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Preface 


Development of modern sciences is characterized by an un- 
paralleled dynamism—the fast build-up of experimental 
evidence and the kaleidoscope of hypotheses and theories. 
A modern man during his lifetime witnesses a dramatic 
change of the scientific view of the world. Therefore, an 
account of the evolution of present-day concepts in any of 
the fundamental scientific disciplines of necessity acquires 
the nature of historical study, although the span covered 
may be as short as several decades. Studies of this sort are 
fairly rare: the author of such a work must be an expert of 
very large calibre, but it is only rarely that a major scientist 
would be willing to spend time on a historical study beyond 
the scope of a traditional review of the literature. 

Most of the contributors to this multiauthor book are 
eminent Soviet physicists with significant results to their 
credit in branches as diverse as spectroscopy, atomic and 
chemical physics, astrophysics, radiophysics, the physics of 
plasma and low temperatures, quantum electronics, and 
the physics of nonlinear phenomena. The title of the book 
does not mean to suggest that the articles it contains claim 
at presenting a complete picture of the evolution of modern 
physics as a whole. The emphasis is rather placed on several 
of fundamental scientific areas that, however, reflect the 
nature of the physics of the 20th century. The contributions 
vary widely in format and language, although there is one 
thing common to them—they all are historical reviews of 
the development of respective disciplines of physics. 

The first contribution is an extensive historical analysis 
of!tho development of quantum concepts from the atomistics 
of the 19th century to quantum mechanics, which along 
with the theory of rolativity formed the theoretical foun- 
dation of 20th century physics. The article focuses on the 
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development of the years 1900-25, which in the opinion of 
the author Academician M. A. Elyashevich (Belorussian 
Academy of Sciences), can be divided into two sets of ideas: 
the first is associated with the quantum theory of the atom 
(Bohr-Sommerfeld-Heisenberg) and the second with the 
quantum theory of radiation and the notion of particle- 
wave dualism (Einstein-de Broglie-Schrödinger). The emer- 
gence of quantum theory in 1925-27 and its later evolution 
is covered in less detail. The story of the quantum ideas 
taking shape conveys the atmosphere of scientific quest of 
those heroic days. Closely linked with this work are two 
further articles on the history and the state-of-the-art of 
the atomic structure (D. N. Trifonov, D. Sc. (Chem.)) and 
the discovery and further studies of atom-like structures: 
positronium and mesoatoms (Academician V. I. Goldanskii 
and V. P. Shantarovich, D. Sc. (Phys.-Math.)). The second 
article is a story of the emergence of the present-day views 
of the atom, which formed under the influence of the dis- 
coveries of X-rays, radioactivity, the electron, and inert 
gases. The article further considers how the development 
of these views has led to the modern understanding of the 
structure of atoms and their transformations, and how on 
this basis an insight has been gained into the periodicity 
of atomic structures. 

The third article is a natural continuation of the second 
one. It is concerned with exotic atoms, their discovery, 
and theoretical explanation, and focuses on applications 
of atom-like objects to a number of important problems in 
physics and chemistry. 

The next two articles, although devoted to the state-of- 
the-art of two most important and promising areas of re- 
search (nonlinear phenomena and quantum electronics), are 
in essence historical accounts of the evolution of these 
fields during the last 25 years. The article by V. S. Leto- 
khov, D. Sc. (Phys.-Math.), deals with quantum electron- 
ics and it highlights the historical question of why the 
first to be invented was the maser, not the laser. Both the 
physics of nonlinear phenomena and quantum electronics 
play an important integrative role in physics. Quantum 
electronics, primarily the laser, has given to rescarchers 
a powerful high-quality energy source, an important addi- 
tion to the experimental tool kit of modern physics. Ad- 
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vances in the physics of nonlinear phenomena open up be 
fore theoreticians a new conceptual and mathematica 
approach, a kind of methodology and technique of ‘nonlin- 
ear thinking’, which are due to the Soviet physicist 
L. I. Mandelstam and A. A. Andronov. 

The article by Academician A. V. Gaponov-Grekhov and 
M. I. Rabinovich, D. Sc. (Phys.-Math.), on nonlinear 
physics deals above all with two fundamental problems: 
the emergence of stochasticity in relatively simple deter- 
ministic systems and the emergence of ordered determinis- 
tie structures in nonlinear media. There are some issues that 
are closely linked with such exotic objects as ‘strange at- 
tractors’ and solitons, and they are of direct concern for 
fundamental and practical problems: evolution theory, 
weather forecast, plasma heating, turbulence mechanisms, 
modern gauge theories of elementary particles, and so on. 

The book concludes with a prognostic review of modern 
physics and astrophysics by Academician V. L. Ginzburg. 
This is a piece of futurology that is based on an analysis 
both of the state-of-the-art and of the history of physics. 
The author, known for his contributions to a wide variety 
of domains in physics and astrophysics, made an attempt 
to single out ‘key problems’ in these sciences and to predict 
their state by the turn of the millennium. Their number 
coincides with that of the famous Hilbert problems, which 
were formulated in mathematics in 1900 and have had a 
great influence on the development of physical and math- 
ematical thinking in this century. By the way, the ninth 
and tenth problems in Ginzburg’s list are novel lasers (in- 
cluding rasers and grasers, i.e. X-ray and gamma-ray lasers) 
and nonlinear physics (including the theory of solitons and 
‘strange attractors’). Historians of science will find of 
interest some views of a professional theoretical physicist 
on scientific revolutions in physics and astrophysics and 
his dispute with one of the originators of the concept of 
scientific revolutions, the prominent U.S. science historian 
T. Kuhn. 

Quantum theory and the theory of relativity underlying 
a theoretical understanding of nearly all ‘key problems’ 
came into being as a result of the second (real, according 
to the author) scientific revolution in physics. The first one 
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The ‘quantum’ part of the second scientific revolution is 
described in the opening article of the book. The influences 
of the revolution on the atomistics of the 20th century are 
considered in the article by Trifonov and that by Goldanskii 
and Shantarovich. All the articles are thus closely related and 
in combination they produce the feeling of vehement growth 
of physical sciences in the 20th century. They are readable 
and exciting and, therefore, will appeal to physicists, histo- 
rians and philosophers of science, philosophers, and also 
to all those fascinated with the physics of our days. 

The Editorial Board of the book extend their profound 
gratitude to all those who by their advise and criticism 
contributed to improvement of this book. 


The Editorial Board 


Quantum Theory. 
Origins and Growth 


M. A. Elyashevich 


l. Introduction 


In the history of physics there is a remarkable period—the 
middle and end of the 1920s. It saw the discovery of the 
principal laws of quantum mechanics, quantum statistics 
and the quantum theory of fields. Quantum theory swiftly 
became well established as an independent full-fledged 
section of theoretical physics capable of adequately ex- 
plaining the microscopic phenomena at the atomic and 
molecular level. The birth of quantum theory was a result 
of the complicated development of physics and related 
sciences in the 19th century and the first quarter of the 
20th century. This development was associated with? at- 
omistic views of the structure of matter and was distin- 
guished by two points. First, physicists began to deal with 
microscopic space scales—atomic (10-8 cm) and then sub- 
atomic (10-1? cm)—with energies of elementary processes 
of an order of electronvolts (for typically atomic and mole- 
cular processes, such as ionization, dissociation, chemical 
reactions) and of megaelectronvolts (for typically nuclear 
processes, such as radioactive decay and nuclear reactions). 
Second, it was found that the laws that regulate the be- 
havior of the microcosm differ fundamentally from those 
of the macrocosm. It was the investigation into the nature 
of microscopic phenomena—‘the mysterious phenomena’ 
[4]—that eventually led to the creation of a consistent 
quantum theory, a revolution in the entire natural science, 
which was dominated by physics [2]. 

In the 19th century atomistics developed along different, 
although interrelated, lines: atomic and molecular teach- 
ing in chemistry (chemical atomistics), molecular kinetic 
theory and statistical methods in physics (physical atom- 
istics), and the symmetry of the internal structure of crys- 
tals (atomistics in crystallography). As the century pro- 
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gressed researchers came up against facts that classical 
physics could not explain, and these facts became a source 
of quantum ideas. Views were becoming ever more popular 
that the atom is a complex system and at the turn of this 
century it was already widely held that the atom is an 
entity consisting of some constituent particles bound by 
forces of electromagnetic interaction, with these forces 
determining the properties of the atom. According to 
Rutherford’s model based on the famous alpha-particle scat- 
tering experiment, such particles turned out to be the nu- 
cleus and the electrons interacting by Coulomb’s law. But 
from classical electrodynamics and mechanics the Ruther- 
ford atom must be unstable, which contradicts the observed 
stability of atoms. This contradiction was only removed 
when Bohr in 1913 applied to the atom his quantum con- 
cepts. This laid the foundation of the quantum theory of 
the atom and atomic systems. 

The inception of quantum thinking and its advances 
before 1913 were primarily associated with the need to ex- 
plain thermal equilibrium radiation and its consequences. 
In 1900 Planck used a statistical approach to derive a law 
of radiation (named after him) by introducing the funda- 
mental assumption that the energy of matter is discrete (the 
quantization of the energy of harmonic oscillators—‘reso- 
nators’—that model the matter that is in equilibrium with 
clectromagnetic radiation). In 1905 Einstein applied sta- 
tistics to radiation and suggested his hypothesis of quanta 
of light, or photons—discreteness of the energy of electro- 
magnetic radiation (the quantization of the energy of ra- 
diation, which in another form, for natural oscillations of 
the electromagnetic field in a cavity, was carried out by 
Ehrenfest in 1906 and by Debye in 1910). Einstein was 
successful in giving a quantum explanation for a number 
of processes of interaction of radiation with matter, and 
then he laid the foundation of the quantum theory of heat 
capacities by drawing on Planck’s ideas about the quanti- 
zation of the energy of matter. Quantum theory was later 
refined by Bohr and Einstein. 

It should be stressed that the evolution of quantum con- 
cepts was a tantalizing and often contradictory process, 
and so it calls for a careful historical examination. It is only 
in recent years that it has been given a correct interpreta- 
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tion [3-8]. In the first quarter of the 20th century this evo- 
lution was characterized by two approaches. One of them 
was the development by Bohr of the quantum theory of 
the atom and applications of Bohr’s correspondence prin- 
ciple; using this principle Heisenberg in the middle of 1925 
derived his matrix formalism of quantum mechanics. The 
other approach was associated with the development by 
Einstein (beginning in 1905) of the idea of particle-wave dual- 
ism for radiation and with the application of this idea by 
de Broglie in 1923 to microparticles; in 1926 Schrödinger 
used this approach to work out the wave formalism of 
quantum mechanics—the wave equation named after him. 
Both approaches yielded mathematically equivalent re- 
sults, quantum mechanics was constructed as a. unified theo- 
ry, and its physical interpretation was supplied. In the 
middle of the 1920s the quantum statistics of Bose-Ein- 
stein and Fermi-Dirac were discovered, which were given 
their quantum-mechanical interpretation. Later on, start- 
ing with Dirac’s works of 1927 on radiation theory, the 
fundamentals of quantum electrodynamics and the general 
quantum theory of fields were worked out. 

Significantly, quantum theory, whose beginnings had 
been connected with the refinement of atomistic views in 
the 19th century, was in turn used to account from unified 
physical foundation for the features characteristic of phys- 
ical, chemical, and crystallographic atomistics. This is 
a good example of interplay of sciences. 

In the rest of the article we will be looking at the major 
phases of the history of quantum theory, from its inception 
to its present form.! 


2. Atomistics in the 19th Century 


The atomistics of the 19th century was characterized by 
the fact that it rested on a huge body of experimental evi- 
dence, unlike the atomistics of the 17th and 18th centuries, 
when the atomic views were largely speculative. Increas- 


1 Some of the issues to be discussed below are considered in more 
detail in the author's papers [9-12]. Note that recently the first volumes 
of a multivolumo monograph have been published devoted to the 
history of quantum theory [13]. i 
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ingly more accurate physical measurements enabled research- 
ers in physics and chemistry to establish quantitative 
relationships that were shown, using the corpuscular ap- 
proach, to be determined by the behavior of microparti- 
cles—atoms and molecules consisting of atoms. The corpus- 
cular theory provided for matter in various states of aggre- 
gation clues to its properties—thermal, electrical, magnetic, 
optical—which were studied by various physical methods. 
The regular shape of crystals and their formation were 
attributed to the symmetry of their internal atomistic 
structure. 

It is important to trace the evolution of atomistics in 
three domains—chemistry, physics and crystallography— 
and the emergence of the fundamental problem of the struc- 
ture of the atom as a complicated system governed by spe- 
cial laws. 

Chemical atomistics.1 Concrete atomistic ideas became 
established in chemistry in the 19th century on the basis, 
for one thing, of experimental studies of the composition 
of bodies from weight ratios in chemical transformations 
and, for the other, of probing into the physical behavior of 
gases and into chemical reactions in gases. The initiator of 
chemical atomistics was Dalton, who studied the properties 
of the atmosphere as a mixture of gases and proceeded from 
the atomistic hypothesis to account for diffusion of gases. 
He established in 1801 the law of partial pressures for gas- 
eous mixtures. Dalton viewed solid and liquid bodies as 
consisting of a sea of exceedingly tiny particles, or atoms, of 
matter held together by an attractive force [14]. According 
to Dalton, all the atoms of a substance are identical, and 
those of various substances differ by their relative weights, 
or atomic weights (the term was first introduced by Dalton 
in 1803). Dalton further distinguished the simple atoms of 
substances consisting of one chemical element, and the 
complex atoms of chemical compounds, i.e. he distinguished, 
using modern terminology, atoms and molecules. He 
also examined various combinations of ‘simple atoms’ 
of two elements A and B (A + B = C, binary; A + 2B = D, 
ternary; 2A + B = E, ternary; A + 3B = F, quaternary, 

1 The evolution of chemical atomistics in the wide sense as used 


in this article, i.e. as atomic and molecular theory in chemistry, is 
discussed in [14-17]. 
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etc.) and established theoretically the most important of 
the stoichiometric laws, the law of multiple proportions. 
He later proved it by experimental data on weight ratios, 
specifically with reference to the various oxygen compounds 
of nitrogen (N,0, NO, N,Os, etc.). Dalton regarded ‘simple 
atoms’ of elements absolutely chemically undecomposable 
and mechanically indivisible. 

It is worth noting that for Dalton the chief characteristic 
of an atom was the atomic weight, and so the concept of 
the atom became an entity not only in philosophy but also 
in natural science [14]. He considered, however, that mol- 
ecules (‘complex atoms’) must consist of different ‘simple 
atoms’ and, being guided by Occam’s razor (the principle 
of utmost simplicity), he held that there exist no di- and mul- 
tiatomic molecules of simple bodies (consisting of one ele- 
ment). 

The law of multiple proportions refers to weight ratios. 
In 1808 Gay-Lussac measured the volumes of each constituent 
gas in a chemical reaction and deduced his combining volumes 
law. For example, one litre of oxygen requires precisely 
two litres of hydrogen to form two litres of water vapor. 
In 1811 Avogadro explained this law on the assumption 
(Avogadro’s hypothesis) that a given volume of any gas 
(under the same conditions) contains the same number of 
molecules?, and on the further assumption (‘division hy- 
pothesis’) that molecules of simple gases can divide to 
yield two or more identical particles (‘elementary mole- 
cules’) of the same nature. According to the concepts of atoms 
and molecules, which in Avogadro’s reasoning were not 
yet sufficiently clear-cut [14], the latter assumption suggests 
that molecules of simple gases can consist of two or more 
atoms. In the special case of gaseous hydrogen and oxy- 
gen their molecules (as it was proved later) are diatomic, 
and the reaction producing water vapor is written as 
2H, + 0,— 2H,0. The volume ratio 2:1:2 is here 
correctly explained by Avogadro's hypothesis. 

It was difficult at the time, however, to correlate Dal- 
ton’s law of multiple proportions and Gay-Lussac’s com- 


2 It was not until much later, in the mid-19th century, that Avo- 
gadro’s hypothesis was proved to be valid in the kinctic theory of 
gases, and came to be known as Avogadro’s law for ideal gases (see 
later in the book). 
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bining volumes law using the fundamental concept of 
atoms and molecules as two qualitatively different discrete 
forms of matter [15]. In dealing with weight proportions the 
notions of chemical equivalents and atomic weights were 
confused. Both Dalton and Avogadro in their treatments of 
the volume of gas particles still proceeded from the later 
rejected theory of the caloric, and made different assump- 
tions as to the size of caloric shells surrounding the atoms. 
Convinced that simple gases consist of monoatomic mol- 
ecules, Dalton considered Gay-Lussac’s law wrong. Avo- 
gadro’s hypothesis had long been rejected by the majority 
of chemists. Some of them distinguished the ‘physical’ 
molecule from the ‘chemical’ molecule. It was not until 
several decades later that the situation was clarified, and 
only by the middle of the 19th century took shape a con- 
sistent concept of atoms and molecules in chemistry, which 
was given the clearest definition by Cannizzaro in his works 
and report at the First International Congress of Chemists at 
Karlsruhe in 1860 [16, 18].? Much recognition was gained 
by the view of the molecule as a particle of a complex or 
simple substance, the smallest amount of it that is capable 
of existing on its own and that is the bearer of its chemical 
and physical properties, and of the atom as a particle of a 
chemical element, the smallest amount of it that enters 
into the composition of simple and complex bodies. Along 
with the relative weight (atomic or molecular, i.e. the rela- 
tive mass) atoms of elements and molecules formed from them 
were characterized by aset of chemical and physical prop- 
erties, the evidence about which was accumulating rapidly. 
There appeared the teaching on the valence of atoms and 
on the chemical bonds in atoms and molecules. Valence 
theory was originated by Kekule late in the 1850s, and 
carly in the 1860s. Butlerov put forward his theory of chem- 
ical structure, which rested on the concept of atoms and 
molecules [19]. 

An important development in the history of chemical 
atomistics was the discovery by Mendeleev in 1869 of the 


3 Contributions to atomic theory of Berzelius, Dumas, Godin, 
Laurent, Gérard, Mendeleev (in his early works before his discovery 
of the periodic law) and others and the struggle between the dualistic 
and mitar approaches are considered in [14, Chaps. 6, 7]; [16, Chaps. 3, 
4]; 117, Chap. 2]. 
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periodic law of chemical elements on the basis of the system- 
atization of enormous experimental material on various 
properties of chemical elements and their compounds and 
the generalization of the various specific regularities dis- 
covered earlier. Mendeleev formulated this fundamental 
law in the 1871 edition of Fundamentals of Chemistry: 
The physical and chemical properties of the elements that 
manifest themselves in the properties of simple and com- 
plex bodies stand in periodic dependence ... on their atomic 
weights [20]. When arranged in the increasing order of 
atomic weights in table form—a periodic system—each ele- 
ment will occupy in it a certain place and the elements 
show a periodicity of properties, similar elements recur- 
ring in a definite order. The properties of elements vary in 
a regular manner along horizontal periods and vertical 
groups. It is well known that this enabled Mendeleev to 
achieve a remarkable prediction of the properties of yet 
unknown elements, for which vacant places were left in 
the periodic system and which were discovered at a later 
date. The totality of properties possessed by a given ele- 
ment and the periodicity of these properties came to be 
ascribed to some latent internal properties of this species 
of atoms; and instead of indivisible particles atoms came 
to be regarded as complex systems. 

Mendeleev wrote that ‘atoms of simple bodies are com- 
plex entities formed by combining several of yet smaller 
parts (ultimata), that what we call indivisible (atom) is only 
indivisible by common chemical forces, just as particles* 
are indivisible under normal conditions by physical forces; 
however, although shaky and arbitrary, this supposition 
involuntarily attracts one’s mind when one gets acquainted 
with chemistry’ [20]. He wrote that ‘the periodic varia- 
tion of simple and complex bodies obeys some supreme law, 
whose nature, and especially cause, one has no means of 
grasping yet. In all probability it resides at the heart of 
the internal mechanics of atoms and particles’ [21]. We 
know now that this mechanics appeared to be quantum 
mechanics applied to the atom consisting of the nucleus and 
electrons, and to the molecules formed from the atoms. 
Mendcleev’s predictions came true in the 20th century. 


“ Mendeleev referred to molecules as particles, 
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Among the properties inherent in each species of atoms 
and showing a periodicity, of especial significance for the 
birth of the quantum theory of the atom appeared to be 
the spectral properties. Atoms of each element have a line 
spectrum of their own, with the atomic spectra becoming 
ever more complex as one progresses from hydrogen through 
alkali metals to further groups of elements in the periodic 
table. The most complicated spectra are to be found with 
transition and rare earth elements. It is the specificity of 
atomic spectra that underlies spectral analysis, discovered 
by Kirchhoff and Bunsen [22]. Specifically, it made it pos- 
sible to determine from the spectra of the Sun and some 
stars the chemical composition of these celestial bodies, 
and to show thus that in various parts of the Universe atoms 
are identical in their properties. In 1860-61, using the 
spectral technique, Kirchhoff and Bunsen discovered cesium 
and rubidium [23], and in 1875 Lecoq de Boisbaudran dis- 
covered gallium, the first of the elements predicted by 
Mendeleev (‘ekaaluminium’). As early as 1871 Mendeleev 
wrote about the great value of spectral studies: ‘Spectral 
studies have brought into science not only an insight into 
the composition of remote celestial bodies, but also they 
have supplied a novel technique to study the bodies on the 
Earth’s surface that are at our disposal’ [20]. He predicted 
that ekaaluminium would be discovered precisely by spec- 
tral technique. 

The most important results of the spectral analysis of 
atoms were the discovery of strict patterns in the spectra 
(beginning with the derivation by Balmer in 1885 of a for- 
mula, which is now called after him, for the spectral line 
series of the hydrogen atom [24]); the establishment of 
combination principle on the basis of the results of Rydberg 
[25, 26] and other scientists, and the generalization of the 
observed patterns by Ritz in 1908 [27]. According to this 
principle the wave numbers of observed spectral lines can be 
represented with exceptional accuracy (it is the higher the 
more accurate are the measurements) as the differences of 
some two of a set of spectral terms T,, Ta, Ts, ... that 
characterizes the atom. But this generalization of experi- 


5 More information about Mendelcev’s predictions and his appraisal 
of the role of spectral investigations are given in [9]. 
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mental evidence had remained absolutely unclear up to 
1913, when Bohr used combination principle as an experi- 
mental basis to build his quantum theory of the atom and 
gave a physical interpretation to it by uncovering its quan- 
tum nature. 

The periodic law played an important role in the creation 
of valence teaching. Atoms show up their periodic proper- 
ties quite distinctly through their valence. At the same time 
it was impossible classically to account for valence, just as 
for the cause of periodicity itself. Only proceeding from 
quantum considerations and using the physical theory of 
the periodic system it became possible to explain valence. 
Present-day valence theories are based on the quantum 
mechanics of atoms and molecules.’ 

Physical atomistics. Atomistic concepts underlie the mol- 
ecular kinetic theory of thermal phenomena, and above 
all the kinetic theory of gases.” As a result, there appeared 
statistical physics as the physics of systems composed of 
an enormous number of interacting moving particles, gov- 
erned by stochastic laws (see, e.g. [31l). The major con- 
tributions came in the second half of the 19th century from 
Clausius, Maxwell, Boltzmann, and Gibbs. 

Back in the 18th century the idea that heat is a kind of 
motion of particles in a body was maintained by Daniel 
Bernoulli and Lomonosov. Lomonosov, specifically, ex- 
plained heat as a motion_of ‘insensitive’ particles and al- 
ready distinguished three modes of their motion—transla- 
tional, rotational, and vibrational [32]. But predominant 
then and early in the 19th century (e.g. Dalton and Avo- 
gadro) were views of heat as a kind of weightless substance— 
caloric. It was not until the mid-19th century that the mol- 
ecular kinetic concept of heat gained recognition. First 
strides were being made by the kinetic theory of gases, 
which treated a gas as a sea of elastic particles perpetually 
moving about and colliding with one another chaotically, 
their collisions with the walls of the vessel giving rise to 
the pressure of the gas. Some results were obtained by 
Herapath, Joule, and Krönig [12]. 


1 ff detailed account of the evolution of valence theory is given 
in 128]. 

7 Tho history of the kinetic theory of gases in the 19th century is 
discussod in [29, 30]. 
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Joule was the first to work out the speed of gas particles 
(for hydrogen he got 1,900 m/s under standard pressure and 
room temperature), and Krönig, having compared his 
result: pressure proportional to the atomic mass m of the 
gas® and the square of the velocity of the atoms, v (assuming 
it is the same for all the atoms), with the Clapeyron equation, 
came to the conclusion that the ‘living force of the atom is 
nothing else but the temperature reckoned from absolute 
zero’. The proportionality of the living force of gas particles 
(i.e. the kinetic energy of their translational motion) to 
the absolute temperature was also mentioned by Joule. From 
this he computed the heat capacity of gases, also he still 
ignored the internal energy of particles. 

The modern phase of the evolution of the kinetic theory 
of gases, which relies on statistical methods, was started 
by Clausius [33, 34]. He calculated the pressure of a gas 
and proved Avogadro’s law from the assumption of elastic 
collision of molecules with the same magnitude of velocity 
(equal to the r.m.s. velocity [12]) and equiprobable directions 
of motion (for simplicity, Joule and Krönig in their cal- 
culations supposed that along each of the Cartesian axes 
moves one third of particles). For molecular velocities of 
oxygen, nitrogen and hydrogen under standard conditions 
Clausius obtained the values 461, 492 and 1,844 m/s, re- 
spectively. For molecular collisions, Clausius introduced the 
important concept of the mean collision length l between 
two successive collisions (the mean free path, in modern 
terminology) and used probabilistic considerations to 
derive a formula relating l to the distance between the 
nearest molecules, 4 (when distributed uniformly over the 
volume of the gas?), and to the radius of the sphere of interac- 
tion of a molecule, p (for the elastic-sphere model it is the 
distance between the centres of colliding molecules, i.e. the 
diameter of the spheres). It was by the smallness of the mean 
free path (which under standard conditions is only two 
orders of magnitude larger than A) that Clausius was”able 
successfully to explain the slowness of gaseous diffusion, 
despite the high molecular velocities. 


8 Krönig used the term ‘atom’ for gas particles. 
® 43 is the volume per molecule, and 1/A® is the number of mole- 
cules of the gas per unit volume, 
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It is important that Clausius combined a statistical ap- 
proach with account of molecular properties. In particular, 
he took into consideration the fact that when two mole- 
cules approach each other they are at first attracted and then 
repelled. He thus defined the radius of the sphere of inter- 
action as the impact parameter’ p = p such that at larger 
distances (p œp) molecular paths are only distorted and 
at smaller distances (p < p) molecules rebound from one 
another as elastic bodies. As to molecules Clausius treated 
them (in accordance with the current thinking in chemistry) 
as combinations of atoms and he also supposed that along 
with translational motion a molecule may undergo an 
intrinsic rotational and vibrational motions. He assumed 
that, as a result of the enormous number of collisions of 
molecules, a certain distribution of energy establishes be- 
tween translational and intrinsic modes of motion, which 
manifests itself through heat capacities of gases. It was 
the study of heat capacities of gases that later brought out 
some disagreements with classical theory, which were due 
to the quantization of the vibrational and rotational ener- 
gies of molecules. 

Clausius identified the validity conditions for the laws 
governing the behavior of ideal gases: the smallness of 
the real volume of molecules as compared with that of the en- 
tire gas, the smallness of collision time as compared with 
the time between two successive collisions, and the negligible 
influence of intermolecular forces. Clausius also applied 
the kinetic theory to the various states of aggregation and 
to the evaporation of a liquid and the dynamic equilibrium 
of the saturated vapor with that liquid. 

As a result, Clausius made a sizeable contribution to 
the development of molecular kinetic concepts.!! It should 
be emphasized at the same time that Clausius, while using 
probabilistic ideas, failed to take into account the specific 
features of statistical laws and the significant difference 
between micro- and macroprocesses: he just believed that 
all phenomena were governed by the laws of mechanics 


10 In modern terminology, the impact parameter is the minimal 
distance at which a particle flying past another particle, assumed to 
ho stationary, does not interact with it. 

! For Maxwoll’s appraisal of his contribution see [35]. 
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[12]. Only in later works, by Maxwell and Boltzmann, the 
specific statistical methods different from dynamic ones 
became the basis of investigation into systems consisting 
of an enormous number of particles [36]. Also, Clausius 
ignored the details of molecular collisions and, having 
introduced l and p, he did not determine their numerical 
values; as for heat capacities he just confined himself to 
general considerations. 

The next and most important phase in the history of 
kinetic theory was concerned with Maxwell, primarily with 
his papers [37, 38],!2 and Boltzmann, who summarized 
his results in his famous book Lectures on Gas Theory [39].'? 
At first [37] Maxwell assumed that as a result of molecular 
collisions in a gas there establishes a certain velocity dis- 
tribution of molecules, and he found the corresponding 
distribution function, which is now known as the Maxwell dis- 
tribution. To derive it he supposed that the distributions of 
molecular velocities over the Cartesian components are 
independent. Later [38] he gave for the velocity distribution 
another derivation of fundamental importance, which relied 
on the compensation under equilibrium of any collisions 
of each kind by reverse processes of the same kind, i.e. on 
the principle of detailed balancing. These arguments were 
expounded in much detail by Boltzmann, who also first 
derived a spatial distribution for molecules (the Boltzmann 
distribution) and generalized the theory to multiatomic 
gases. As a result, the general form of the energy distribu- 
tion of gas molecules—the Maxwell-Boltzmann distribution— 
was obtained, according to which the probability of a 
molecular state with energy E varies as exp (—E/kT), 
where T is the absolute temperature, and k the Boltzmann 
constant. The energy E was assumed, in accordance with 
classical laws, to be a continuous function of coordinates 
and velocities (or momenta) describing the motion of a 
molecule (including its intrinsic degrees of freedom). From 
this followed the law of equipartition of energy (which for 
the kinetic energies of translational and rotational mole- 


12 A more detailed analysis of these and other works by Maxwell 
on molecular and statistical physics is given in [42]. 

13 About Boltzmann's works see, for example, [29-31]. Also see 
[40], ‘Translator’s Introduction’ by Brush to [39]. 
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cular motions was obtained by Maxwell [37]), and, as a 
consequence, the proportionality of heat capacity to the 
number of degrees of freedom (with factor 2 for vibrational 
degrees of freedom), and the independence of heat capacity 
of temperature. The theory of heat capacity was in sharp 
contrast with experiment, which was characterized by 
Thomson (Lord Kelvin) at the turn of this century as one 
of the ‘nineteenth century clouds over the dynamical theory 
of heat and light’ [41], and had to await the coming of quan- 
tum theory.‘ 

Maxwell [37] used the elastic-sphere model for molecules 
to consider molecular collisions’* and, using Clausius’s 
concept of mean free path J, he advanced a theory of trans- 
port phenomena: internal friction, diffusion and heat 
conduction. This enabled him for the first time to estimate 
(using the values of the coefficients of internal friction and 
diffusion) 7 and the number of collisions of a molecule per 
unit time. And Loschmidt in 1865 [42] first estimated mo- 
lecular diameters p for atmospheric gases from the rela- 
tionship between l/p and volume ratio for a gas and a li- 
quid (from this one can also estimate the Loschmidt number 
L). Molecular diameters were then estimated by other 
scientists as well and they appeared to be of the order of 
angstroms. In 1873 Maxwell [43] used mutual diffusion 
coefficients determined by Loschmidt for a number of 
gases to obtain refined values of l, and from these the dia- 
meters of molecules H,, O,, CO, and CO (5.7, 7.6, 8.3 and 


9.3 A, respectively) and L = 1.9 x 101° cm~ (instead of 
2.9 x 101° cm-®). The kinetic theory of gases made it thus 
possible to find the correct order of molecular sizes (and 
also to estimate the number of gas molecules per unit vol- 
ume, and hence the Avogadro number N,, whose value 
was later, early in the 20th century, found by Perrin by 
a more direct method from the height distribution of sus- 
pended particles in a liquid). It turned out, however, that 
the question of what dictates the size of atoms and mole- 


14 For binary collisions, he applied geometrical diagrams for veloc- 
ities, which satisfied energy and momentum conservation laws and 
were also valid for particles colliding elastically with interaction 
forces dependent arbitrarily on the centre-to-centre distance. This 
ee was later used by Boltzmann and others and became com- 
mon 12). 
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cules could only be answered by the quantum theory of 
atomic and molecular structure. 

Statistical methods in physical atomistics benefited enor- 
mously from analysis of processes responsible for the estab- 
lishing of an equilibrium. Maxwell suggested a phenome- 
nological description of this process using the notion of 
relaxation time (for viscosity) [38, 12]. In his funda- 
menta! work [38] he also developed the theory of transport 
phenomena not in terms of mean free path’, but of the 
statistics of the time variation, due to collisions, of a cer. 
tain function Q of molecular velocity components (a linear 
function for diffusion, a quadratic function for gaseous pres- 
sure and internal friction, a cubic function for heat con- 
duction). Maxwell averaged the derivative with respect 
to time 0Q/dt over various types of collisions by integrating 
over the impact parameters (so appeared the ‘collision 
integral’).%¢ 

It was of great importance that Boltzmann applied the 
general method, used by Maxwell to find 0Q/dt, to the ve- 
locity distribution of molecules, f Soltzmann thus arrived 
at his famous kinetic equation—the Boltzmann equation 
for 0f/Ot—and at the time variation of the H function he 
introduced (the H-theorem). He showed that the H function 
is proportional to In W (W is the state probability for a 
gas viewed as a system of particles), and that as an equilib- 
rium is achieved H tends to zero and that —H only differs 
from the entropy of the gas, S, by a constant factor and 
a term. Boltzmann thus derived a law relating entropy 
to probability, which could be written (which Planck did 
later) as S = k ln W + So, where S, is a constant. The 
growth of entropy and its tendency to a maximum as an 
equilibrium is reached were accounted for by an appropriate 
growth of the state probability for a system of particles. 


15 This approach to the theory of heat conduction, described by 
Maxwell in [37], was criticized by Clausius [45]. Maxwell agreed with 
the criticism and so appeared his further treatment of transport phe- 
nomena [46]. 

16 Maxwell assumed [38] that molecules repel one another with 
a force dependent on the distance r between their centres, but for 
specific computations he used a simplified version of the formulas, 
based on the assumption that the repulsion force is inversely pro- 
portional to r® (the case of ‘Maxwellian molecules’). This simplification 
delighted Boltzmann [47]. 
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Thus the second law of thermodynamics was shown to be 
a statistical law. The statistical nature of the second law 
of thermodynamics was first shown by Maxwell (see e.g. 
[12]), but it was Boltzmann who found the law relating the 
entropy of a gas to its state probability. In the most general 
form for any system of interacting particles the relationship 
between entropy and probability was studied by Gibbs 
in his statistical mechanics [48], which can be viewed as 
the conclusion of the evolution of physical atomistics at 
the molecular level in the 19th century. 

It is worth noting that early in the development of quan- 
tum theory by Planck and Einstein the application of Boltz- 
mann’s atomistic entropy-probability relation to thermal 
radiation (see below) played a significant role. 

Crystallographic atomistics. Atomistics was the key con- 
cept of the teaching—on crystals. Crystallography, closely 
related both with mineralogy and with physics and chem- 
istry, developed in two directions." On the one hand, the 
science dealt with the symmetry of the external shape of 
crystals, their formation and growth, their various prop- 
erties; it established various patterns and laws drawing 
on a body of experimental data. On the other hand, along 
with this empiric direction there were theoretical studies 
of the internal structure of crystals on the basis of assumptions 
concerning the regular arrangement of particles that form 
the crystal, i.e. on atomistic concept. 

The idea that the symmetry of crystals is concerned with 
regular arrangement of appropriately shaped particles in 
space has been held by some scholars ever since the 16th 
century. So Cardano, reflecting on the hexagonal shape 
of rock crystal, suggested that the internal structure of 
crystals is dictated by the close packing of spheres. In 1611 
in his remarkable concise treatise On Hexagonal Snow Flakes 
[54] Kepler considered various forms of the packing of spheres. 
Another proponent of this hypothesis was Hooke. But 
Huygens assumed that the regular form of Iceland spar 
is due to the ellipsoid shape of the constituent particles, 
whereas Guglielmini thought of crystals as consisting of 


17 The history of crystallography is a subject of fundamental 
monographs by Shafranovsky [49, 50]. They discuss the role of atom- 
jstic concepts in much detail, and contain extensive bibliography. 
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tiny particles of the same form. A major contribution to 
the atomistic theory of crystalline structure was made by 
Lomonosov, who explained in particular the crystallo- 
graphic law of constancy of angles in terms of packed 
spheres [49]. 

The modern teaching on the internal structure of crystals 
—structural crystallography—was originated by Haiiy. 
He treated crystals as consisting of tiny polyhedra—‘con- 
stituent (or integrating) molecules’. In 1783 in his book 
Experience of the Theory of Crystalline Structure and Its 
Application to Heterogeneous Crystalline Substances Haüy 
sought to ‘find for a given crystal an exact shape of its 
constituent molecules, their mutual arrangement and the 
laws that govern the variations of its constituent layers’ 
[52]. Haüy summarized his main results in the Course of 
Crystallography published in 1822, where he gave the fol- 
lowing definition of crystals: ‘Molecules, being attached 
to one another by the surfaces most suitable for this attach- 
ment, produce in combination regular bodies bounded by 
flat faces and analogous to geometrical figures. Such sub- 
stances are in general called crystals, whatever the substance 
from which they are made up’ [52]. 

Haiiy considered three types of ‘integrating molecules’: 
tetrahedron, trihedral prism, and parallelepiped. Both 
from tetrahedra and trihedral prisms one can make up paral- 
lelepipeds and so Haüy came to the important conclusion 
that all the crystals can be made up of parallelepipeds, as 
of tiny molecular building blocks. This was the beginning 
of the theory of lattice structure of crystals. This theory 
was later developed by Bravais. In 1850 in his Memoir of 
a System of Points Arranged Regularly in a Plane or in 
Space he listed 14 types of spatial lattices (later called the 
Bravais lattices) [50], which were based on the translational 
symmetry of crystals. In addition, some scientists, starting 
from Hessel (1830), gave their classification of crystals 
from the symmetry of ‘end figures’—crystalline polyhedra 
(in modern parlance, from definite point groups of symmetry). 
It was shown that in crystals, 32 kinds of symmetry are 
possible which correspond to point groups with symmetry 
axes of the first, second, third, fourth, and sixth orders in 
some combinations with a symmetry centre and symmetry 
planes [50]. This formed the subject of the 1867 work by 
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Gadolin A Derivation of All the Crystalline Systems and 
Their Versions from a Common Origin. 

A complete solution of the problem of symmetry of the 
internal structure of crystals from geometrical point of 
view was given by Fedorov [50]. In his 1885 book Funda- 
mentals of the Teaching on Figures Fedorov considered the 
filling of space by polyhedra and singled out five principal 
types of polyhedra (‘parallelohedra’), he also formulated 
the concept of a regular system of figures and a regular 
system of points. In 1890 he published his classical treatise 
The Symmetry of Regular Systems of Figures, in which he 
gave a complete derivation of 230 spatial groups of sym- 
metry possible for crystals (Fedorov groups). He wrote: 
‘A complete derivation is given here for all regular systems 
of points and the first steps are made to identify all pos- 
sible kinds of crystalline structure’. An independent iden- 
tification of 230 spatial groups was given in 1891 by Shoenflies 
in his book Crystalline Systems and Structures. This work 
was based on group theory. 

So on the basis of the ideas of discreteness of matter and 
its atomistic nature the theory of the symmetry of the in- 
ternal structure of crystals was worked out. But in the 
19th century the questions of what particles make up crys- 
tals and how these particles are arranged in a crystalline 
unit cell could not yet be answered. The answers came in 
1912 when Laue discovered the diffraction of X-rays and 
applied to crystals X-ray diffraction analysis. Fedorov im- 
mediately responded to Laue’s discovery: ‘For us, crystal- 
lographers, this is a discovery of first-class importance, 
because now for the first time and with full clarity is re- 
produced what we laid only theoretically at the founda- 
tion of our views of crystalline structure, which by the way 
is the backbone of crystal analysis’ [50]. 

Symmetry is most important to modern quantum theory. 
It was in crystallography that spatial symmetry of systems 
having atomistic structure was studied for the first time. 
Atomistics in crystallography became one of the sources of 
application of symmetry techniques in physics. Prompted 
by the studies of crystal symmetry Pierre Curie laid down 
as early as the 1880s and 1890s ideas of the fundamental 
value of the notion of symmetry of physical phenomena 
and formulated the universal principle of symmetry-dis- 
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syminetry [53]. Group-theoretical methods that originated 
in crystallography now became quite common in the quan- 
tum theory of solids and molecules. 


3. The Atom as a Complex System. 
The Rutherford Model 


The evolution of atomistics in the 19th century led to 
ever more specific formulation of problems concerned with 
the structure of matter at the atomic and molecular level—the 
atomic structure of molecules, the structure of crystals and, 
especially important, the structure of atoms themselves as 
complex systems. These problems were concerned with the 
issue of the nature of forces that determine the structure of 
matter at this microscopic level, which, it appeared, was 
characterized by space scales of the order of 10-8 cm, i.e. 
of the order of the size of atoms and simple molecules, of 
intermolecular distances in a liquid, of the sizes of unit 
cells in crystals. Gravitational forces, very small at these 
distances, cannot account for microscopic phenomena. 
Experimental and theoretical studies of electrical and mag- 
netic phenomena, started on a large scale since the beginning 
of the 19th century, resulted in discoveries of a wide variety 
of properties of matter at the atomic and molecular level’, 
which in turn warranted the fundamental conclusion that 
the interaction forces, responsible for the structure of matter 
at this level, are electromagnetic in nature. 

As early as the first half of the 19th century it was found 
that there exists an interconnection between chemical 
and electrical phenomena: the conversion of chemical ener- 
gy into electric one in the galvanic cell, the chemical effects 
of a current in electrolytes (electrolysis). 

According to electrochemical theory, due to Berzelius 
[56] and called dualistic, chemical affinity was explained 
by the electrostatic interaction of two parts of a molecule. 
These parts were thought to carry unlike charges and hence 
to attract each other. Although this theory could not ex- 
plain all the complex aspects of molecular structure and 


1 The history of physics of the 19th century is highlighted in 
many monographs, e.g. [54] and [55]. 


30 M. A. Elyashevich 


turned out to be unsuitable for organic compounds (for 
which other scientists proposed a unitary theory [14]), the 
very idea of accounting for the formation of’ molecules by 
an electrical interaction held true. Only much later, in the 
1860s, de Lavaud, in connection with the theory of valence 
and chemical bonding, explained the formation" of mole- 
cules by electric and magnetic interaction forces [28]. He 
dealt with electrostatic interactions of atoms_in molecules 
and also put forward the hypothesis that atoms consist of 
‘subatoms’, which are carriers of affinity units, and which 
can form pairs by magnetic interactions, just like magnets. 
De Lavaud thus gave a concrete picture of the complex 
structure of the atom. 

It should be emphasized, however, that the idea that 
molecules consist of atoms because of the electromagnetic 
nature of interaction forces did not find support among the 
majority of the chemical community, including the prom- 
inent chemist Mendeleev; chemical atomistics developed 
on independently of these concepts and the nature of intra- 
molecular forces was not disclosed. Nor was the nature of 
intermolecular interaction forces revealed in molecular 
kinetic theory. So Maxwell, Boltzmann and others regarded 
molecules as force centres with the repulsion forces between 
molecules being functions of the centre-to-centre distance. 
Crystallographic atomistics did not concern itself with the 
nature of interaction forces and the properties of the partic- 
les that form the crystal. The symmetry of the internal 
structure of crystals was only treated in geometrical terms. 

Of great importance for the discovery of electromagnetic 
nature of interaction forces at the atomic and molecular 
level was the discovery of elementary electric charges car- 
ried by particles of matter, and studies of electric discharges 
in gases (i.e. plasma, to use modern terminology), which 
resulted in the discovery of the first elementary particle, 
the electron. 

The hypothesis that particles of matter have associated 
with them discrete electric charges, positive or negative, 
was based on the laws of electrolysis established by Fara- 
day in 1832-33. He found that as a certain amount of elec- 
tricity is passed through electrolyte a certain amount of 
matter is deposited on electrodes, and that the mass of the 
matter is proportional to the chemical equivalents. Atom- 
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istically, this is due to the fact that each cation or anion 
that neutralizes at electrodes carries a definite electric 
charge equal to an elementary electric charge e (for univalent 
atoms) or its multiples. This conclusion concerning the 
amount of electric charge connected with particles of matter 
was clear even to Faraday, who wrote that if one accepts 
the atomic theory, then atoms, which are equivalent in 
terms of their chemical action, contain equal amounts of 
electricity, which is associated with them in a natural 
manner. 

But he did not expand on this idea since, as he put it, 
although it is very easy to speak about atoms, it is quite 
difficult to form a clear understanding of their nature [57]. 
Later, in 1873, Maxwell wrote about the consequences of 
Faraday’s laws that cations and anions have constant 
charges that should be called the molecular charge. If it were 
known it would become the most natural unit in electricity 
[58]. But Maxwell also referred to the convenience of this 
definition (which was at odds with the theory of continuous 
field) for investigation into electrolysis. 

The most clear-cut comments on the fundamental value 
of the existence of elementary electric charge were made 
independently by Stoney and Helmholtz. In his report ‘On 
the Physical Units of Nature’ made in 1874 and published 
in 1881 [59] Stoney suggested for the role of three natural 
units the ratio of electrostatic units to electromagnetic 
ones (i.e. the velocity of light c), the gravitational constant 
g and the elementary electric charge e, found from elec- 
trolytic effects. Stoney wrote that nature gives usin elec- 
trolysis a certain unit of electricity, which is independent 
of specific substances. And Helmholtz in 1881 at Faraday 
Lecture spoke about the application of the atomistic hy- 
pothesis to electric phenomena. He maintained that, if 
applied to electric processes, this hypothesis, in combina- 
tion with Faraday’s law, would lead to stunning conse- 
quences. And from the assumption that there exist chemical 
atoms follows that electricity, positive and negative, also 
breaks down into certain elementary amounts that appear 
as atoms of electricity [60]. 

It seemed natural at the time to apply the concept of 
travelling particles having definite electric charges not 
only to the electric current in electrolytes but also in met- 
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als. But the nature and properties of current carriers in 
metals (they are now known to be electrons) remained 
unclear. There were no techniques for probing into the 
motion of charged particles within metals, just as within 
electrolytes. New possibilities were offered by the studies of 
electric discharges in gases, when current carriers are charged 
particles moving independently of one another in an orderly 
manner. As they do so they manifest their corpuscular prop- 
erties most distinctly and they can be observed to be in- 
fluenced by external magnetic and electric fields. During 
the studies of gas discharges cathode rays were discovered, 
but a number of studies were required to establish their cor- 
puscular nature as negatively charged particles. It was of 
great significance that to obtain electric discharges in gases 
researchers were now using voltages of thousands and tens 
of thousands of volts (this became possible by the advances 
made by high-voltage technology), and so charged particles 
could be accelerated up to energies (in modern units) of 
thousands and tens of thousands of electronvolts. Elementary 
processes were now investigated that had energies several 
orders of magnitude more than the energy of elementary 
chemical processes (electronvolts) and average thermal 
energies of particles of matter (of the order of kT, i.e. hun- 
dredths of electronvolt under normal temperatures and of 
electronvolts only at temperatures of tens of thousands of 
degrees). 

Electric discharge studies paved the way for the discovery 
of X-rays in 1895 and the electron—the first elementary 
particle—in 1897. One indirect result was the discovery 
in 1896 of radioactivity, which signalled the beginning of 
probing into elementary processes with energies of the 
order of megaelectronvolts, several orders more than the 
energies obtained at the time in gas discharges. 

These great discoveries of the last decade of the 19th cen- 
tury formed the foundation for solving later on the problem 
of the atomic structure as a complicated system consisting 
of interacting charged particles, the nucleus and electrons. 

It was found that electrons are unquestionably available 
in all atoms. This was evidenced by their emergence during 
gas discharges as cathode rays during thermionic emission 
(discovered by Edison as early as 1884), during photoelec- 
tric effect (discovered by Hertz in 1887 and studied exten- 
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sively by Stoletov and other scientists). Supporting evi- 
dence came from measurements of the ratio of charge e to 
mass m for charged particles, which (as Lorentz proposed 
in 1897 according to his classical electron theory) undergo 
simple harmonic oscillations within the atom to account 
for the splitting of spectral lines of atoms in a magnetic 
field discovered by Zeeman in 1896 and named after him. 
The value of e/m computed by Lorentz from Zeeman-effect 
data coincided with the results for the electron obtained 
from cathode-ray experiments, and also for the negative 
charges produced in thermionic emission and photoelectric 
effect. Later, in 1911, Barkla could, basing himself on the 
theory of X-ray scattering by atomic electrons, due to 
J. J. Thomson, find that for light atoms the number of 
electron scatterers per atom is about A/2, where A is the 
atomic weight. 

Early in the 20th century a number of models were sug- 
gested for the atom containing electrons. According to the 
planetary models of Perrin (1901) and Nagaoka (1904) the 
atom consists of the central positive charge surrounded 
by electrons revolving around it (in Nagaoka’s model they 
form rings like the Saturn rings). In 1903 Lenard worked 
out a model in which the atom consisted of ‘dynamides’ — 
pairs of positive and negative charges taking up only a 
small share of the atom’s volume, thus accounting for the 
transparency of the atom for fast electrons. But these models 
were not refined and computed. A more specific model was 
developed by J. J. Thomson [61, 62]. According to it the 
atom consists of a positive ball-shaped charge distributed 
over a volume, in which negatively charged electrons oc- 
cupy some equilibrium places, being arranged in the plane 
on the rings around the centre of the ball. Thomson assumed 
that the periodicity of the properties of the chemical ele- 
ments is due to new external rings added as the number of 
electrons in the atoms increases with its atomic weight. La- 
ter Bohr wrote about the significance of Thomson’s model: 
‘Ever since the famous attempt of J. J. Thomson to inter- 
pret the periodic system on the basis of stability of the 
various electron configurations the idea that electrons are 
broken down into groups became the starting point of later 
concepts as well. Thomson’s supposition that the positive 
charge is distributed in the atom turned out to be inconsis- 
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tent with the experimental evidence obtained from studies 
of radioactive substances: Nevertheless this work contains 
many ingenious ideas and has had its effect on the further 
development of atomic theory’ [63]: 

Rutherford with coworkers proved experimentally that 
Thomson’s model was incorrect. Their experiments turned 
dut to be decisive for clarifying the atomic structure. They 
carried out experiments on scattering of alpha-particles emit- 
ted by radioactive substances when their beams are passed 
through thin metal foils. Unlike the earlier models of the 
atom, the Rutherford model, based on the results of these 
experiments and presenting the atom as consisting of the 
central nucleus and electrons that surround it—the Ruther- 
ford atom—was borne out experimentally and exceedingly 
convincingly at that. Later on it was receiving much sup- 
porting evidence from various sources. With good reason 
Rutherford can be considered the discoverer of the atomic 
nucleus. 

In experiments with beams of alpha-particles (which are, 
as Rutherford had shown previously, doubly charged pos- 
itive helium atoms ejected in a radioactive decay of atoms 
at about 10° cm/s, which corresponds to kinetic energies 
of the order of megaelectronvolts) early in 1909 were found 
deviations from the initial direction of a small fraction 
(about 10-*) of the alpha-particles by angles more than 90°. 
This was at variance with Thomson’s model, which assumed 
that as a result of multiple (‘complex’) scattering at angles 
of about 0.01° the average deflection? will be about 1°, and 
the probability of occurrence of large deflections will be 
negligible. From the large deflections observed Rutherford 
concluded that an atom contains an intense electric field, 
since otherwise it would be impossible for a particle to be 
deflected within such a tiny distance as the diameter of a 
molecule [64]. 

It was when he was refining this idea that Rutherford put 
forward in 1911 his nuclear atomic model [65, 66]. Sig- 
nificantly, he explained the observed large deflection angles @ 


2 If a single atom contributes on average to the total deflection of 
an alpha-particle an angle g, then the average deflection by n atoms 


will be Vno and at n ~ 10% (as in Rutherford’ s experiments with 
thin metal foils) and ọ = 0.01° we obtain V ng x 1°. 
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(from tens of degrees to 180°) by single scattering when alpha- 
particles with impact parameters p of about 10-1! cm and 
even less approach the nucleus to distances of the same or- 
der of magnitude® at which their interaction with the nu- 
cleus becomes very strong. Using classical mechanics and 
probabilistic considerations, Rutherford derived a formula 
for the scattering of alpha-particles (the Rutherford scatter- 
ing law) assuming a Coulomb repulsion between an alpha- 
particle and a nucleus and using energy and momentum 
conservation laws. The formula is known to be in agree- 
ment with experiment and made it possible to estimate 
the nuclear charge Z (in e units) to be about A/2 [66]. It 
is worth noting that when the Rutherford scattering law 
was supported experimentally it became absolutely clear 
that the atom is a system of interacting charged particles 
consisting of the nucleus of large mass and positive charge? 
and electrons of small mass and negative charge, the positive 
and negative charges being equal in the atom. It is quite 
significant that to repel an alpha-particle by a nucleus the 
Coulomb law turned out to be only valid up to distances of 
the order of 10-1? cm. Since Rutherford, in deriving his 
formula, assumed the nucleus to be a point, this implied 
that the nucleus is about 10-1? cm across. 

The most important and direct consequence of the Ruther- 
ford model was the possibility to divide an atom’s properties 
into two types: those determined by the nucleus and those 
determined by the electron shells [67]. The first person to 
understand and accept this consequence was Bohr and it 
was clear he had done so by 1912 when he was working at 
Rutherford’s laboratory. 

The properties of the first type include the nuclear charge, 
the atomic mass (which differs but slightly from the nucleus 


3 An alpha partide comes the closest to a positively charged and 
repelling it nucleus at p = 0, corresponding to @ = 180°, and accord- 
ing to Rutherford’s estimate for heavy atoms the minimum value b 
of the distance between an alpha-particle and a nucleus is 3.4 X 
10-12 cm. At p = b/2, 9= 90°; at p= b, ọ = 58°; at p = 2b, 

= 28°; and at p = 5b, ọ = 11.4°. 

4 Note that in [66] Rutherford was still very cautious and so he 
admitted a possibility of a negative charge at the nucleus (the Ruther- 
ford scattering law contains Z? and is thus independent of the sign 
of the charge). It was not until later that he made his final conclusion 
that the nucleus is positively charged. 
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mass because the electron mass is small), and the radioac- 
tive properties (in addition to nuclear moments—mechanical, 
magnetic, and electric—discovered later). The properties 
of the second type, due to the electron shells in the atom, 
are the size of the atom as a whole, the chemical and most 
physical properties, including electric, magnetic, and op- 
tical. 

Among the properties of the first type the most important 
one is the positive charge Z, which determines the number 
of electrons held by the nucleus in the neutral atom, and 
hence the properties of the electron shells, i.e. the proper- 
ties of the second type. 

The periodic law, i.e. the periodicity of second-type prop- 
erties, was later on explained by applying quantum theory 
to the structure of electron shells. It was of especial signifi- 
cance to identify the value of Z—the atomic number—with 
the serial number of an element in the periodic table, which 
was first, and especially clearly, formulated by Van den 
Broek and which should be regarded as an independent 
discovery.® 

This discovery rested directly on the Rutherford model, 
just like the explanation of radioactive displacement 
law, discovered independently by Russel, Soddy and Fajans, 
which related first-type properties (radioactive decay of 
the nucleus) to second-type properties (chemical properties 
of an element) [70, 71]. 

It is to be stressed that the above division of atomic 
properties corresponds to the difference in the order of 
magnitude between nuclear phenomena (typical distances 
10-13 cm and energies of the order of megaelectronvolts) 
and atomic and molecular phenomena (10-8 cm and electron- 
volts). This then brought about the division of the physics 
of the microworld into atomic® and nuclear physics, with 
their differences determined by different space and energy 
scales and the nature of interaction forces. 


5 About Van den Broek and his discovery see [68, 69]. 

® In the narrow sense of the term, in current use nowadays. For 
a long time atomic physics covered the entire physics of microscopic 
phenomena. Nuclear physics too is at present divided into nuclear 
physics proper and the physics of elementary particles and high- 
enorgy physics (energies of the order of gigaelectronvolts and 
more). 
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In 1911-13 and nearest subsequent years, i.e. directly 
after Rutherford had formulated his atomic model, the sec- 
ond-type properties were central problems in atomic phys- 
ics. Attempts to explain these properties proceeding from 
electromagnetic nature of interaction forces, which determ- 
ine the atomic structure, immediately run into great difficul- 
ties. They consisted in that, according to basic concepts of clas- 
sical physics, on which rested all the previous theory of the 
atom as a complex system, the atom made up of a positive 
nucleus and negative electrons must be unstable. 

Atomic electrons, if they moved with an acceleration in 
orbits around a nucleus being attracted to it by Coulomb 
forces, according to classical physics would continuously 
lose energy by radiation. As a result they would approach 
the nucleus and eventually collapse onto it, and the radia- 
tion produced would have acontinuous spectrum. Classically, 
gas atoms or molecules would also be mechanically unstable 
under collisions. Collisions would affect the motion of atom- 
ic electrons, and hence the frequencies of the emitted and 
absorbed light. Accordingly, the observed stability of atoms, 
the discreteness and invariability of the spectra could not 
be accounted for by the laws of classical physics. 

Rutherford understood that it would be difficult to ex- 
plain the stability of his nuclear model of the atom. He 
was however sure that the model was correct, and he just 
wrote that at the time it was too premature to turn to the 
issue of stability of the atom suggested, since stability was 
dependent on the details of atomic structure and the motion 
of the atomic charged particles [66]. 

It only became possible to prove the stability of the plan- 
etary atom by abandoning for this purpose the classical 
concepts and applying instead novel ideas of quantum theory, 
which had been gaining recognition since the turn of this 
century due to the contributions of Planck, Finstein and 
other scientists. This step was made by Bohr in 1913 and 
his atomic theory will be considered later in the article, 
after a treatment of the origin and early days of quantum 
theory, 
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4. Origin and Early Days of Quantum 
Theory 


Quantum concepts were first introduced by Planck in 1900 
[72] who at the time was dealing with thermal radiation at 
equilibrium with matter—equilibrium radiation. Planck 
made his fundamental assumption concerning quantization 
of energy of matter (of ‘resonators’ that model the matter) 
and he succeeded in deriving the experimentally supported 
statistical law 
8hv3 1 4 
Uy, T= — exp(hv/kT)—1 ’ (1) 


which defines the spectral energy density u,,r for equilib- 
rium radiation (per unit volume filled with the radiation 
and per unit frequency interval v) as a function of absolute 
temperature. In Planck’s radiation law (formula) (1) k is 
Planck’s constant (quantum of action, as Planck called it 
later [73]) used in the physics of the microworld—quan- 
tum physics. 

Planck’s law was the result of a long path of the devel- 
opment of the theory of equilibrium radiation, closely 
linked with the studies of emissive and absorptive powers 
of bodies in ever greater range of temperatures and frequen- 
cies and with ever greater accuracy. The value and physical 
meaning of this law were comprehended at a later stage of 
the development of the theory, which was also a fairly 
complex period, but now on the basis of quantum ideas. 
This law turned out to be associated with the discreteness of 
the elementary events of emission and absorption of radiation 
by matter and, especially significant, with quantization of 
the energy of the radiation itself, which had been revealed 
by Einstein in his works of 1905 and later years (see below). 
When in the mid-1920s the Bose-Einstein statistics was dis- 
covered, it appeared that Planck’s law was a special case of 
that {statistics. 

It is of interest to trace down how the theory of equilib- 
rium radiation evolved and its laws established, how the 
quantum nature of these laws was revealed and quantum 
concepts made their appearance, how the transition occurred 
from statistical consideration of equilibrium radiation to 
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the quantum approach to elementary processes of interaction 
of radiation with matter and to the properties of the matter 
itself, its particles—atoms and molecules. Early in the 
development of quantum theory, before Bohr developed his 
theory of the atom, the major contributors to the theory 
(from the 1890s to 1913) were Planck and Einstein. It is 
important to understand the close ties linking the work of 
these scientists together and to the earlier work of Wien 
to appreciate correctly the contributions of Planck, Einstein 
and other scientists to quantum theory in the early days 
of its development. 

We now proceed to consider the most interesting land- 
marks in the history of emergence and initial growth of 
quantum theory. 

Equilibrium radiation theory before Planck and Einstein. 
The foundations of the theory were laid by Kirchhoff in 
1859-60 in his classical works [74, 75].2 He considered for 
bodies in complete thermodynamic equilibrium at a given 
temperature mutual compensation of absorption and emis- 
sion of any monochromatic radiation and proved theoret- 
ically that ‘the ratio of absorptive to emissive power ... 
for all bodies at the same temperature is the same’ [75]. 
Kirchhoff started from the concept of absolutely black bodies 
introduced by him, ‘which at an infinitesimal thickness 
totally absorb all the incident rays, i.e. neither reflect 
nor pass them’ [75]. Kirchhoff’s law; which relates the 
absorptive and emissive powers of a body, can in modern 
symbols and in terms of frequency® be written as 


By, 7/Ay, r= BY r, (2) 


where B, r is the emissive power of the body in a given di- 
rection, which is defined as the radiation flux in this direc- 


1 A more detailed treatment of a number of issues and an extensive 
bibliography are available in [3] in a chapter on the formation of 
quantum theory. A detailed analysis of the early history of the theory 
of equilibrium radiation is given in [8], which also contains some 
works by the originators of the theory. The prehistory of Planck’s law 
is the subject of [5]. The developments up to 1913 are considered in 
[4, 7]. See also [6] and [9-11]. 

2 Jlistorically, these works were connected with the discovery by 
Kirchhoff and Bunsen of spectral analysis (see above, and also [3, 54]). 

3 Kirchhoff himself wrote his law in terms of wavelengths 2.. In 
what follows we will use v = c/A and other conventions, 


40 M. A. Elyashevich 


tion from a unit surface (the direction is taken care of by 
cos 8, where 8 is the angle with the normal to the surface) 
per unit solid angle per unit frequency range; A, 7 is the 
absorptive power of the body (the ratio of the energy ab- 
sorbed to the energy incident on the body in the opposite 
direction), and By’; is the corresponding emissive power of 
an absolutely black body, which is the same in all directions 
and is related to the spectral energy density u, r of equi- 
librium radiation by BY’; = (c/An) u,,r. 

It should be emphasized that Kirchhoff’s law (2) is based 
on the assumption that there exists complete thermodynamic 
equilibrium of radiation with matter and it only holds 
rigorously for thermal radiation equilibrium.’ The task of 
finding for equilibrium radiation the universal function 
Br of frequency and temperature (and hence the function 
u,,r Which characterizes equilibrium radiation filling a 
certain volume, e.g. a cavity with walls at T or a volume 
of gas at the same temperature when the mean free path for 
the radiation is shorter than the size of the volume) was 
thought of by Kirchhoff to be ‘a problem of immense im- 
portance’ [75].° This problem of finding the spectral distri- 
bution for equilibrium radiation at a given temperature 
was solved by Planck, who derived the law (1). And 
knowing that quantum law and further developments in 
quantum theory, we can now gain a correct understanding 
of later works whose authors, after Kirchhoff, sought to 
establish laws of equilibrium radiation. 

At first came the Stefan-Boltzmann law, which states 
that the integral energy density of equilibrium radiation 


up = Í uy rdv is proportional to T4: 


° ur = al, (8) 


where a is a constant. This law was first obtained by Stefan 
in 1879 [76] who processed experimental data on integral 


4 It holds approximately for nonequilibrium, when thermal radi- 
ation is emitted by the surface of a body heated to a high temperature 
into a colder medium with its own radiation small in comparison with 
the emission of the body. 

5 Kirchhoff considered the universal function J of wavelength 
and temperature, from which it is easy to go over to equivalent uni- 


yersal functions in another form, specifically to Bir Or un,T- 
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emissive power of bodies at various temperatures, and 
then, in 1884, Boltzmann derived it theoretically [77]. 
He applied to equilibrium radiation the relation Tdp — 
pdT = urdT, p being the radiation pressure, based on the 
second law of thermodynamics. This implied introduction 
in implicit form for equilibrium radiation of entropy S 
(see [8]). To determine the pressure for equilibrium 
radiation Boltzmann made use of the relationship p = u,/3, 
which follows from Maxwell’s electromagnetic theory of 
light with account of the isotropy of that radiation. Bo!tz- 
mann thus drew on thermodynamic and classical electro- 
dynamic considerations. It is significant, however, that 
the relation p = ur/3 also holds in quantum theory (where 
it is derived in a straightforward manner from the concept 
of the photon momentum hv/c). And so (3) holds. Moreover, 
only in quantum theory a is finite and can be computed. In 
fact, integrating (1) with respect to v gives 
up = f Uy, r dv = (8nkt/h3c3) (14/15) T4 = aT4, 
0 


where a = 8n5k*/15h3c?. It is inversely proportional to 
h? and as h— 0 (the limiting case of classical theory), 
a— m. 

Major contributions to the theory of equilibrium radia- 
tion were works by Wien of 1893-96 [78-80] devoted to find- 
ing the spectral distribution of equilibrium radiation, a 
problem posed by Kirchhoff. It was from these results that 
Planck and Einstein started off in their research in quantum 
theory. 

Wien [78] obtained a law that gave the general behavior 
of the temperature dependence of spectral distribution, 
which can be written as (see, e.g. [73]) 

v3 

u,e=F(7), (4) 
where F is an arbitrary function of v/T. This law is gener- 
ally referred to as Wien’s displacement law (see, e.g. [8] 
and [11]). To find u, r, a function of two variables, is thus 
to find F, a function of one variable v/T, and the Stefan- 
Boltzmann law follows as a consequence of (4), the value 
of a being dependent on the form of F, 
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In deriving (4) Wien, like Boltzmann, started from the 
second law of thermodynamics and the electromagnetic 
theory of light. He considered the adiabatic compression 
of equilibrium radiation in a closed volume with reflecting 
walls and took into account the change in the wavelength 
of the radiation when reflected from a moving wall (accord- 
ing to Doppler’s principle). It is to be noted that the law 
(4), just like (3), also holds in quantum theory (where 
Doppler’s principle holds as well). It satisfies the Planck 
formula (1). 

A more detailed treatment of the law (4) Wien gave in 
[79]. For equilibrium radiation in a volume with reflecting 
walls he handled explicitly the entropy and introduced 
the notion of entropy density for monochromatic radiation, 
which was later used by Hinstein (see below). 

Wien [80] also found the specific form of spectral distri- 
bution, and obtained a formula that in terms of frequency 
has the general form 


uy, = 9 (v) exp [—f (v)/T], (5) 


where ọ (v) and f(v) are some functions of frequencies. 
This formula meets the displacement law (4), if we put 
@ (v) = av/c3 and f (v) = fv, where a and f are constants. 
This leads us to Wien’s radiation law 


exp (—Bv/T), (6) 


which is a purely quantum limiting case of Planck’s ra- 
diation law (1) at large v/T (when hv/kT >>1, i.e. the 
energy quantum hv is much larger than the average thermal 
energy kT). Constants & and ß are proportional to h: 


a = 80h, B = hik. (7) 


In 1896 Wien arrived at the law (6) from the formal si- 
milarity between the spectral distribution of equilibrium 
radiation and the statistical law of the Maxwellian distri- 
bution over velocities v, which varies with v?/T as 
exp (—v?/T), where y is a constant. Wien supposed, draw- 
ing on the ideas contained in the 1887 work by V. A. Mikhel- 
son [81] (see about it in [82]), that gas molecules give off 
radiation with a wavelength A dependent on their velocity 
p (conversely, v is a function of A) and that the energy of the 


avs 
c3 





Uy, r= 
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radiation is proportional to the number of such molecules, 
and hence to exp (—yv?/T). For v? that varies inversely 
with A (i.e. proportional to v), as assumed by Wien, this 
immediately gives the explicit form of F (v/T) in (4): 
F (v/T) = a exp (—fv/T), where a and ß are constants, and 
further (6).6 It is easily seen that if —Bv/T = — hv/kT 
in (6) and —yv?/T = — mv?/2kT (m being the molecular 
mass) in the Maxwellian distribution this gives mv?/2 = 
hy, i.e. in modern thinking this corresponds to the pro- 
duction of a photon due to the kinetic energy of fast mole- 
cules (mv?/2 = hv >> kT). Of course, there is no direct trans- 
fer (as it was supposed by Mikhelson and later by Wien) of 
the energy of particles into the energy of radiation. Instead, 
in a binary collision, due to the relative kinetic energy of 
two molecules, one molecule may be excited from an energy 
level Æ, to E; and then emit a photon with the energy 
E; — E, = hy (conversely, a photon with energy hv may be 
absorbed by a molecule, and the excitation energy E; —E,, 
may then in a collision go over into the relative kinetic 
energy of the colliding molecules). 

We thus see that Wien came close to the idea of quanta 
of light, or photons, and it was for this reason that it yielded 
the purely quantum law (6) valid at large v/T. Hund wrote 
that ‘for the first time a thought occurred that radiation 
is associated with particles whose energy varies with v. 
These were not yet in full measure the future quanta of Ein- 
stein, but they came close to them’ [6]. 

Wien’s radiation law (6) was finding much experimental 
support at large v/T = c/AT, at which first measurements 
of emissive power of bodies were made. It became the start- 
ing point of fundamental investigations, at first by Planck, 
who discovered the quantum of action and energy quanti- 
zation, and then by Einstein, who discovered the quanti- 
zation of radiation energy. These discoveries gradually 
revealed the quantum nature of microscopic effects, quan- 
tum concepts were gaining much ground and the scene was 
set for Bohr to apply them, on the one hand, to the atom, 
and, on the other, to the ideas of particle-wave dualism. 


ë Mikhelson assumed v to vary inversely with A, which yielded 
an exponential function proportional to v?/7 that was al variance 
with the displacement law. 
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Quantum of action. Quantization of energy of matter and 
of radiation energy. Planck’s studies of equilibrium ra- 
diation can be divided into several stages. 

During the first one, from 1894 to early in 1900, Planck 
carried out a number of investigations into equilibrium of 
radiation and matter, proceeding, like Wien, from thermo- 
dynamics (a subject of his research earlier in his career) 
and classical clectrodynamics. He sought a rigorous sub- 
stantiation of Wien’s radiation law (6), which, although 
in agreement with experiment, was deduced by Wien with- 
out much rigour. The results of this phase of his work Planck 
summarized in his papers [83, 84]. He modelled matter in 
equilibrium with a ‘natural’ radiation (whose energy is 
‘distributed absolutely nonuniformly over individual 
partial oscillations’ [83] and to which the concept of 
entropy is applicable), as a multitude of ‘resonators’—har- 
monic oscillators—oscillating with various frequencies and 
exchanging their energy with the radiation field by reso- 
nance emission and absorption of electromagnetic waves 
at appropriate frequencies. It is to be stressed that for 
Planck resonators were a model required to obtain radia- 
tion laws, but he was not interested in the specific character- 
istics of the oscillators (‘the inner structure of the resona- 
tors’). He wrote: ‘It is insignificant whether the oscillations 
of the elementary resonators are based on conduction cur- 
rents (of course, in the absence of an electric resistance) 
or on convection currents (motion of electrically charged 
particles)’ [83]. For a multitude of resonators that inter- 
act with radiation Planck solved two problems: first, 
he found connection of u,,r with the average energy FE, 7 
of a resonator; second, he found this average energy at a 
given temperature 7, common for matter and radiation, 
i.e. at complete thermodynamic equilibrium.’ 

Handling the first problem, Planck, in accordance with 
classical electrodynamics, treated emission and absorption 
of electromagnetic energy as continuous processes. He 
obtained the formula 

nv? 
U,,T°® F Ey r= ZE, T? (8) 


3 





_ 7 Planck used the notation Z,,p = U, and Einstein later Zr = 
Ber, y7 7 
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where Z, = 8nv?/c®. It is of interest to note the simple 
physical meaning of the factor Z,: it is the number of natural 
oscillations of the equilibrium radiation itself (per unit 
volume per unit frequency). This was found later by Rayleigh 
[85, 86] and Jeans [87], who from the classical energy equi- 
partition law obtained the law 


uy, p= ZykT = E kT, (9) 





where kT is the average energy per natural oscillation. The 
Rayleigh-Jeans law (9) is a purely classical limiting case 
of Planck’s radiation law (1) for small kT (when hv = 
kT <1, i.e. when the energy quantum is far smaller than 
the average thermal energy) and it does not contain h. This 
law immediately follows from (8), if we put E,r = kT by 
applying the energy equipartition law to the resonators that 
modelled the matter. But Planck did not follow this path;® 
only in 1906 he considered the law (9) [73]. 

As for the second problem, Planck was successful in 
finding the average entropy S,,r of a resonator as a function 
of E,,r [83]: 


E 
Sy r= (nr -1), (10) 
where a and b are constants. If now we take into account 
the conditions under which the total entropy of resonators 
of various frequencies and ‘natural’ radiation is maximal, 
we will obtain the formula E, r = bv exp (—av/T). If 
then we substitute it into (8), we obtain Wien’s radiation 
law (6) with constants a = 8nb and ß = a. Expression (10) 
for S, 7 is a purely quantum one, just like Wien’s radia- 
tion law. Comparison with (7) shows that the constants in- 
troduced by Planck in (10) are a = h/k and b = h. Sig- 
nificantly, the expression now includes a constant having the 
dimensions of action—the quantum of action h. Planck 
himself stressed the universal nature of a and b and sug- 
gested to use them as the basis (together with c and g; cf 
[59]) of the system of ‘units of nature’. Using Wien’s ra- 
diation law he computed from experimental data the nu- 
merical values a = 0.4818 x 101° s-°C and b = 6.885 x 


8 Later this was made by Einstein [88]. 
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10-2” erg-s [83],° which only slightly differ from the 
modern values of h/k and h. To be sure, Planck, just as 
previously Wien for the law (6), had no idea that he was 
dealing with the purely quantum expression (10), in which 
E,,r/bv = E, lhv is the average number of energy quanta 
per resonator, and E, +/av = k (E,,r/hv) is the same num- 
ber multiplied by Boltzmann’s constant k, which is in agree- 
ment with quantization of the resonator’s energy, as in- 
one by Planck in his later works (beginning with 
72)). 

The second, transitional stage (1900), was an important, 
although a short, one. Sticking to his thermodynamic 
method Planck semiempirically obtained for a multitude 
of resonators and ‘natural’ electromagnetic radiation [89] 
his famous radiation law (1), which can be written as 


avs 1 
Uy, T= “3 “Sep VLT ? (11) 
where & and ß are the same constants as in Wien’s radiation 
law (6).1° Planck’s approach was based on the behavior 
of the second derivative of the average entropy of an os- 
cillator, Sy r, with respect to its average energy E,,r. Planck 
had shown previously (early in 1900 [84]) that this deriva- 
tive, which appears when dealing with small deviations of 
resonator’s energy from its stationary value (i.e. fluctua- 
tions of its energy), in the general case has the form 
dS, r/dE} r = — f (Ey,r), where f(E,,r) is an arbitrary 
positive function of E, r. In a special case where it has the 
form 


@S, rldE} p = — AIE, r, (12) 


where A is a positive coefficient, which can depend on v, 
it can yield Wien’s radiation law; A = 1/av, where a is the 
constant in (10) [84]. 

Planck thought that he succeeded in justifying (12), and 
hence in proving the validity of that law for all v/T. But 


® These results and the derivation of Wein’s radiation law were 
first reported by Planck at a meeting of the Prussian Academy of 
Sciences on 18 May 1899. 

10 In his work [89], first reported at a meeting of the German Phys- 
ics Society, Planck gave an equivalent formula with two constants 
for the spectral distribution over wavelength A = c/v. 
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when it was shown experimentally (specifically by Rubens 
and Kurlbaum) that Wien’s radiation law does not work 
at small v/T, Planck suggested, instead of (12), a more 
general formula 





Syt o 8 o ‘ 
dup ~~ By Pie + Ey a)? (18) 


which contains two coefficients!! s and e. It was from this 
formula that he obtained, using the thermodynamic rela- 
tion dS,,/dE,,r = 1/T and Wien’s displacement law, the 
law (11) with œ = 8ne/v and ß = e/sv, which agrees with 
experiment not only at large v/7, when it reduces to Wien’s 
radiation law, but also at small v/7, when u,,r varies with 
T (see (9)). 

Comparison with (7) shows that in (13) e = hv ands = k. 
The denominator of (13) thus includes hv (which made its 
explicit appearance only in the next work by Planck [72]) 
and in the limiting case € = hy < Fy 7 


aS, p/dEy, r= — k/ E}, T, (14) 


and this immediately gives the Rayleigh-Jeans law (9);!? 
whereas in the limiting case e = hv > Ey,r formula (13) 
reduces to (12) (where A = s/e = k/hv) and leads to Wien’s 
radiation law (6). Formula (13) can be regarded as an in- 
terpolation between the limiting cases (12) and (14).1? Its 
more profound meaning was only revealed much later, after 
Einstein had shown [92] that for equilibrium radiation the 
mean square of fluctuations of its energy, which varies in- 
versely with the second derivative of entropy with respect 
to energy, is the sum of the corpuscular and wave terms 
(see below). It turned out that in reality (13), and hence 
(11), reflects the particle-wave dualism for radiation (see 
[3, 8, 93]). 


11 They have the dimensions of entropy and energy. Planck himself 
denoted them as & and f and formula (11) was written by him in terms 
a Wav eleneihe with coefficients C and c [89] (in Planck’s notation [72] 

=c and c = cg). 

12 Integration of dS,r/dE,r = 1/T gives KE,r, whence 
Eur, = kT, and by (8) wp = (8nv2/c) KT. 

3 Thus Planck referred to it [90, 91] later. In [89] he had not yet 
a the limiting case of small v/T and about interpolation 
see ; 
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In the third, especially important, stage of his fundamen- 
tal research Planck published the works [72, 94, 95] during 
an exceedingly short period of time—from the end of 1900 
to the beginning of 1901—in which he for the first time in- 
troduced explicitly his quantum concepts. He sought the 
average energy of a resonator, E,,r, using, instead of thermo- 
dynamics, a statistical approach, which was based on Boltz- 
mann’s relationship between entropy and probability S = 
k In W + S, (see above).!* Planck also sought to give a 
more rigorous justification to his formula (11) and to unearth 
its ‘real physical significance’. And in this he succeeded 
‘after several weeks of the most strenuous work’, as he 
said later in his Nobel lecture [90]. He obtained it in 
the form (1), which contained explicitly the universal 
constants k and K = R/N, . In terms of k and v Planck de- 
termined the discrete energy element € = hv, which (denoted 
as ß) enters into his formula (13) for dS, 7/dE),¢ 
[89].#® 

Planck worked out the average entropy S,,r for a set 
N of resonators of frequency v as a function of their average 
energy E, 7, = En/N, where Ey is the total energy of all 
resonators. He started off with the assumption that this energy 
consists of ‘an integral number of finite equal parts’ 
[94]—of P elements of energy (i.e. Ey = Pe), and com- 
puted the probability W of the distribution of P elements 
over N resonators. Planck then made use of the relationship 
Sy =k ln W between the total entropy of all resonators 
Sy = NS, r and the probability W (setting So = 0) to 
obtain 


Syr=Sy/N =k| (4 + Fr) In (1 er) 


& 


E Ey, 
- mr]. (45) 


14 Planck then introduced a constant k equal to the ratio of the 
gas constant R and Avogadro’s number N ,, Later on it came to be 
known as;Boltzmann’s constant. 

15 His results Planck first reported at a meeting of the German 
Physics Society on 14 December 1900 [72]. This day is generally re- 
garded as a birthday of quantum theory (see [ 96]; see also [3]). 
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From this, using the thermodynamic relation dS, r/dE,,r = 
1/T and Wien’s displacement law!“ that holds at e= khv, 
he obtained the average energy of the resonator 
hv 
Ey, r = win’ (16) 
and, from (8), the law (1)."? 

From Planck’s hypothesis about discrete energy ele- 
ments hv distributed over resonators (harmonic oscilla- 
tors) followed the only conclusion possible: the energy of 
such an oscillator E, must be a multiple of hv: 


E, = ne = nhy, (17) 


where n is an integer (including zero). In modern ter- 
minology, formula (17) describes the quantization of energy 
of the oscillator, its energy levels. Since Planck’s resonators 
modelled matter, Planck’s hypothesis concerning the dis- 
crete nature of energy had to do with matter and meant 
quantizalion of energy of matter. 

It is quite significant that quantization of energy of the 
harmonic oscillator (17) suggested that its energy varies 
in a discrete manner, so that the energy can only change by 
e = hy or its multiple.’* Consequently, from quantization 
of energy of matter also followed the discrete nature of the 
elementary processes of interaction of radiation and matter, 
which is especially characteristic of the physics of the micro- 
world. 

It is worth noting that Planck was exceedingly cautious 
about any consequences involving the behavior of his 
resonators, which followed from his hypothesis of energy 
elements e, or energy quanta as they came to be known later. 


16 Planck showed [94] that this law can be represented as 
Sr = I (Ey,7/v), where f is an arbitrary function of one variable 
T! V. 

“i? Note that the expression (10) (with a = h/k and b= h) follows 
from (15) for £y,7/e = Ey,r/hv <A (i.e. when the average energy 
of the resonator is far smaller than hv) and yields, Ey, p = Av X 
exp (—hv/k7T) and Wien’s radiation law (6) with constants (7). 

18 Precisely this conclusion was made by Einstein in 1906 in [97]. 

19 It was not until much later that it was found that for radiation 
emission and absorption for the harmonic oscillator An = +1, and 
therefore when dealing with interaction of radiation with matter one 
should take into account energy changes only by =+ e (for dipole radia- 
tion). 


4—0107: 
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Initially Planck did not attach any fundamental importance 
to the multiplicity of the energy of an individual resonator 
to hv, he was interested not in the dynamic properties of 
this resonator, but in the statistical behavior of sets of 
resonators.”° 

An important stride in the evolution of quantum con- 
cepts was made by Einstein in 1905 in [88]. It was his first 
work on quantum theory.?! Unlike Planck, who proceeded 
from the assumption of the discrete nature of energy of mat- 
ter that interacts with radiation and that is modelled by 
resonators, Einstein assumed the discreteness of energy of 
radiation itself in the form of the hypothesis of quanta of 
light, or quanta of radiant energy (or photons in modern 
terminology). He went a different path from Planck’s pro- 
ceeding not from Planck’s results but rather directly from 
Wien’s works (see [82]). 

Einstein started with very clearly setting the wave theory 
of light, which assumes that the electromagnetic field is 
continuous and ‘beautifully holds when describing purely 
optical phenomena’, against the corpuscular theory of light, 
which assumes that ‘the energy of light propagates through 
space in a discrete manner’ and gives a better description 
of ‘phenomena that are associated with production and 
absorption of light’ [88]. Then Einstein assumed that an 
oscillating electron—‘resonator’— is described by E, p> = 
kT, according to the classical theory (‘kinetic theory of 
gases’). He goes on to show that Planck’s relation (8) leads 
to the law (9), and that this law disagrees with experiment, 
since | u,,rdv = oo. Thus, Einstein not only independently 

0 


20 Only in 1911 in his report to the First Solvay Congress Planck 
posed the question of ‘what dynamic law must be taken for oscillators’. 
He maintained that ‘it is the easiest to suppose that the oscillator’s 
energy must be an integral multiple of the energy element e = hv’ [98]. 
At the time Planck was much concerned by the disagreement 
with classical electrodynamics of the discrete nature of emission and 
absorption processes, which followed from quantum theory, and he 
made an attempt to settle this contradiction, but to no avail (for more 
petal see [3], where all the necessary bibliography is also avail- 
able). 

21 A detailed analysis of Einstein contribution to quantum theory 
in the years 1905-25 is given in [11]. 
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deduced the Rayleigh-Jeans law, but also clearly visualized 
the ‘ultraviolet catastrophe’. 

He obtained the same result in the limiting case of large 
T/v, or small v/T, from Planck’s formula [94] for u,,r, 
which agreed with all the previous experiments. He wrote 
it in the form (11) and in the later sections of [88] he did 
not use or mention it. He concluded that ‘our theoretical 
premises (the classical theory—M.E.) for short wavelengths 
and small radiation densities ... turn out to be absolutely 
unsuitable’. Later in the work he used Wien’s radiation 
law (6) as a law ‘supported experimentally very well’ for 
large v/T (i.e. he used the purely quantum limiting case 
of Planck’s law (1), see above). 

Planck expressed by (8) the spectral density of radiant 
energy u,,r through the average energy of a resonator 
E,,r, which characterized the properties of matter, and used 
Boltzmann’s relation to determine E,r for matter as a 
function of the average entropy of resonator S,,r. In con- 
trast, Einstein, knowing u,,r for radiation, sought the 
entropy of this radiation as a function of volume and used 
also Boltzmann’s relation, but now as applied to the radi- 
ation itself, to clarify the physical meaning of the resultant 
expression for the entropy. He came to the fundamental 
conclusion that at large v/T radiation possesses corpuscular 
properties. 

To begin with, Einstein found, following the ‘famous 
work by Wien’ [79], the total entropy of radiation in volume 


vasS=v | op, dv, where @, is the entropy density for 


0 
radiation of frequency v. He then found the maximal value 
of S and obtained for equilibrium radiation the relation 


Py, r/Ouy, r= 1/T, (18) 


which is valid for all frequencies. He wrote: ‘This is precisely 
the law of black body radiation. Consequently, from the 
function @,,r we can obtain the law of black body radi- 
ation, and conversely, integrating this law and considering 
that @,r = 0 at uy, = 0, we can obtain the function 
@v,r. Using this method Einstein found from the law (6) 
the entropy density Py,r = —(u,,r/ßv) [In (u,,re?/av?) — 
1] and for the entropy of radiation (with frequencies from 


4* 
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v to v + dv) S = vg,,r dv as a function of v he obtains the 
formula 
E v 

S— S= In (+). (19) 
where E = vu,,r dv is the energy of radiation with fre- 
quencies from v to v + dv in volume v, and S, is its entropy 
at volume vy. Einstein compared (19) with a similar formula 
for the entropy of the ideal gas or the diluted solution of 
n particles. This formula can be obtained as a special case 
of the Boltzmann relation S — So = (R/Na) In W for 
the ‘statistical probability’ W = (v/v,)" ‘that at a randomly 
taken time moment all n points that move about independent- 
ly of one another in a given volume v, will happen (by chance) 
to be in the volume v’. This comparison suggested to Ein- 
stein that for radiation one should put” 


gs eee 


(20) 


He concludes: ‘Monochromatic radiation of low density 
(within the region of validity of Wien’s radiation law) in 
terms of heat theory behaves as if it consisted of independent 
energy quanta RBv/N,’ (i.e. kßv = hv) [88]. Further 
he wrote: ‘But if monochromatic radiation (of sufficiently 
low density) in the sense of dependence of its entropy on 
volume behaves as a discrete medium consisting of energy 
quanta RBv/N,, then the question presents itself of whether 
the laws of production and transformation of light are such 
as if light consisted of similar energy quanta.’ And Einstein 
goes over from statistical treatment of the laws governing 
equilibrium radiation to treatment of elementary processes 
of interaction of radiation with matter—emission and absorp- 
tion of radiant energy in the form of quanta of light— 
photons.?? He explained the Stokes law as a result of emis- 
sion of a photon of frequency v,, which is smaller than or 
equal to the frequency v, of the absorbed photon (hv,< hv,). 
He also obtained for ‘excitation of cathode rays on exposure 


22 In [88] Einstein did not yet introduce Boltzmann's constant k 
and Planck’s constant k. These constants are k = R/N, and h = 
kB = RB/N, (see formula (7)), therefore Na E/RBv = E/kBv = Elhv. 
The last expression is the number of quanta of radiant energy. 

23 The term ‘photon’ was suggested by Lewis in 1926 [99] and stuck. 
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of solids’, i.e. for photoelectric effect, his famous equation, 
from which the maximum kinetic energy of an electron 
knocked out by a photon is eV = hv — P, where P is the 
work function, e is the electron charge, V is the retarding 
potential.” Then he dealt with the ionization of gases by 
ultraviolet light on the assumption that ‘each absorbed 
quantum causes the ionization of one molecule’ (this was 
the initial formulation of the law of photochemical equi- 
valent, whose connection with thermodynamics was the 
subject of later works by Einstein [100-102]). 

Einstein treated the elementary processes of interaction 
of radiation with matter basing himself on the concept of 
the discrete nature of radiant energy—the quantization of this 
energy and its immediate consequence that the very processes 
of emission and absorption are discrete. Planck obtained this 
conclusion in an indirect way as a consequence of the dis- 
creteness of the energy of matter (see above), which is 
not always the case in reality. It was Einstein’s approach 
that made it possible to reveal the discreteness of the energy 
of radiation itself, which manifested itself through its 
interaction with matter, and to relate the scale of the fre- 
quency v of electromagnetic waves to the scale of the photon 
energy hv which corresponds to particle-wave dualism for 
radiation (see below). 

The works by Planck [72, 94, 95] and Einstein [88] just 
considered paved the way for the development of quantum 
notions on the basis of the concepts of quantization of the 
energy of matter, of the discreteness of elementary process 
of interaction of radiation with matter, and of the quantiz- 
ation of the energy of radiation. The early quantum think- 
ing was dominated by the first two ideas, especially the 
quantization of the energy of matter. This set the scene 
for the quantum theory of Bohr’s atom, whereas the idea of 
the quantization of radiant energy suggested by Einstein 
in the form of the hypothesis of light quanta had taken its 
time gaining recognition till the 1920s as an idea beingin 
an especially sharp contrast with classical electrodynam- 


24 Even now the work [88] is often incorrectly referred to as ‘Ein- 
stein’s paper on photoelectric effect’, athough its key point is the 
hypothesis of light quanta. 
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ics.2> Note, however, that it was this idea that in later 
years (beginning in 1909) became the starting point when 
Einstein was dealing with the issue of particle-wave dualism 
of radiation (see Sec. 5). 

In our analysis of the growth of quantum thinking after 
1905 we will concentrate on further works by the fathers 
of the quantum ideas, Planck and Einstein. 

In 1906 Planck (after an interruption?) gave a summary 
of his results in the review monograph [73], now a classic. 
It contained a very important advance: for the quantization 
of the resonator—the harmonic oscillator—he obtained 
quantization as a result of breaking down phase space into 
identical elementary regions. He divided the plane q, p 
(q and p are the generalized coordinate and momentum of 
the oscillator) into elementary regions of area AE/v = h, 
each bounded by two ellipses corresponding to oscillator 
energies E and E + AE. Planck thus introduced energy 
increments AE = e = hy ‘which correspond to the same 
probabilities and which, therefore, should be called energy 
elements’. He underscored: ‘The fact that the constant h is 
introduced as a certain finite quantity is characteristic of 
the entire theory expounded here.’ It was this constant, 
which enters into the law (1), that Planck termed the ele- 
mentary quantum of action. He noted that it is ‘the size of 
an elementary region in the plane of states of the resonator 
that is valid for resonators with an arbitrary period of 
oscillations’ [73]. In his report [98] Planck presented 
his result in the form 


Ete 
n= | | dp dq. (21) 


E 


25 In 1913 Einstein was nominated to the Prussian Academy of 
Sciences by Planck, Nernst, Rubens and Warburg. After an exceed- 
ingly high appraisal of Einstein’s contribution the citation went on 
to say: ‘The fact that in his reasoning he ventures beyond the confines 
of his aim, for example in his hypothesis of light quanta, should not 
be held against him too much. Because without taking risks it is 
really impossible to accomplish novel things even in most exact 
natural science’ [93]. 

26 From 1902 to 1905 Planck did not publish any works on quantum 
theory and his ideas had not yet become widely known. 
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Later this result came to be represented as the quantum con- 
dition 


É pdq=nh, (22) 


where n is the quantum number. Also it was applied to 
other systems, besides the oscillator, and generalized to 
the case of many degrees of freedom. It is worth noting that 
Planck himself in 1906 was interested not in the quantization 
of the energy of an individual resonator, but in equal 
probabilities for energy elements’e proportional to areas in 
phase space (cf Footnote 20). 

Division of phase space into identica? cells of finite di- 
mensions (of volume A? for particles with three degrees 
of freedom) became, in combination with the principle of 
indistinguishability of particles, one of the major methods 
in quantum statistics. Planck’s book [73] gained wide 
recognition and contributed to the wide acceptance of the 
idea of quantization of the energy of matter.?’ 

Of especial significance for the further evolution of quan- 
tum theory were the 1906-7 works by Einstein [97, 104], 
which marked the second phase in his quantum studies. 
Just as in [88], they combined statistical approach with 
examination of elementary processes (which was also char- 
acteristic of his earlier works of 1901-2, [11]). But unlike 
[88], which was based on the limiting case of Wien’s radia- 
tion law (6) and not on the more general Planck’s radiation 
law (1), even the work [97], published in March 1906, posed 
and answered the question of the relation of Einstein’s 
approach (quantization of radiant energy) to Planck’s 
approach (quantization of the energy of matter).?® 

In the beginning of his work [97] Einstein mentioned 
his previous paper [88], in which he ‘came to the conclusion 
that light with a frequency v can be absorbed and emitted 
only in energy quanta (R/N,) Bv’. He went on to say: ‘It 
seemed to me then that Planck’s radiation theory [94] was 


27 Tt saw four editions, each revised appropriately (the fifth edition 
was in 1923 [103]). 

28 Notice that Einstein refers to Planck’s works [89, 94] but not 
to his book [73], which was issued only early in 1906. 
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in a way in contradiction to my work. But new arguments 
that are provided in Sec. 1 of this paper carry conviction 
that the theoretical basis of Planck’s reasoning differs from 
the one that might be obtained from Maxwell’s theory and 
the theory of electrons. 

‘Planck’s theory actually implicitly used the above 
mentioned hypothesis of light quanta.’ Later in the 
paper Einstein "showed that Planck’s assumption that 
the energy of resonators is discrete disagrees with 
Planck’s relation (8), from which must follow the Rayleigh- 
Jeans law (9), not Planck’s law. Einstein concluded ‘that 
at the heart of Planck’s theory lies the following statement. 
The energy of an elementary resonator can only assume 
integral values, multiples of (R/Na) Bv (i.e. hv—M.E.); 
upon absorption and emission the resonator’s energy changes 
abruptly, namely by a multiple of (R/N,) Bv’. Then he 
made another important conclusion: ‘If the resonator’s 
energy can only change abruptly, then to find the average 
energy of a resonator in a radiation field one should not 
use the conventional theory of electricity, because this 
theory knows no distinguished values of energy. Hence such 
an assumption is inherent in Planck’s theory.’ 

We see that Einstein brings out in Planck’s theory the 
relation between the discreteness of emission and absorption 
processes (which, according to Einstein, follows from the 
hypothesis of light quanta) and the assumption of distin- 
guished—discrete—values of the energy of resonators, i.e. 
of quantization of the energy of matter. Later on Einstein 
[104]? refined the idea of quantization of the energy of 
matter and found an exceptionally important application 
for it to the theory of heat capacities. 

"#At first Einstein provided a very simple and important 
‘derivation of a formula for the average energy of Planck’s 
resonator, a derivation that clearly reveals a connection 
with molecular mechanics’. He started with Boltzmann’s 
distribution, applying it for the first time to quantized 
states (which has become commonplace now), and found for 
a resonator with discrete energies (17) the average value of 


29 In this work he already refers to Planck’s monograph [73]. 
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this energy by a formula, which can be written in the form 


S} En exp (— En/kT) 


Ey, 177 
>) exp (—En/kT) 
n=0 


N) nhy exp (— nhv/kT) 


_. _n=0 = hv 
u exp (hv/kT)—1 * (23) 


5 exp(—nhv/kT) 

n=0 
He thus very simply obtained the result (16), which Planck 
produced in a much more involved manner.?® Einstein used 
then this result and the relationship (8) (which he retained 
assuming that ‘Maxwell’s theory of electricity gives a cor- 
rect picture of the relation between the radiation density 
and E,,r') to arrive at Planck’s law (1). 

Further Einstein made a very important general conclu- 
sion: ‘... up until the present time it was believed that 
molecular motion is governed by exactly the same laws as 
the motion of bodies in our everyday experience (with the 
addition of only the postulate of total reversibility), but 
now it must be assumed that for ions, oscillating with a 
certain frequency and involved in energy exchange between 
matter and radiation, the set of states that can be taken 
by these ions is smaller than for bodies in our everyday 
experience.’ Einstein goes further and poses the question: 
‘If we cannot understand the elementary formations whose 
existence was assumed in the theory of energy exchange 
between radiation and matter within the framework of the 
present-day molecular kinetic theory of heat, then should 
we perhaps modify the theory also for other periodically 
oscillating formations dealt with by the molecular theory 
of heat?’ His answer to the question was positive, and so 
he applied the law of quantization of the harmonic oscillator 
(17) to the specific (although rather crude) model of atoms 
that vibrate independently in a solid body and came to the 
quantum theory of heat capacities. For the average energy 


30 Note that in [98] Planck furnished Einstein’s derivation with 
reference to [104]. 
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of oscillations per atom (with three rotational degrees of 
freedom assigned to each atom) Einstein arrived at the triple 
quantity (16), and for the gram-atom at the energy 
3Nıhv/lexp (hv/kT) — 1], which when differentiated with 
respect to 7 gives the heat capacity (per gram-atom) 


oe exp (hv/kT) (hv/kT)2 
Co = 3R Tex kA (24) 


This heat capacity vanishes as T —0, which agrees with 
experiment qualitatively. 

This paper by Einstein solved in principle the difficulties 
of the classical theory of heat capacities (see above) and 
marked the beginning of the quantum theory of heat capac- 
ities, which have been developing successfully ever since. 
Large contributions to it were made by Nernst, Debye, 
Born and others. Einstein himself made a report at the First 
Solvay Congress in 1911 entitled ‘On the State-of-the-Art 
of the Issue of Specific Heat’ [105], in which he also con- 
sidered general issues of quantum theory. 

In the years 1908-13 those areas of quantum theory took 
shape which at a later date (in the mid-1920s and later) 
predetermined its establishment (see Introduction). These 
areas were associated, on the one hand, with the concepts 
of quantization of the energy of matter and discreteness of 
elementary processes of interaction of radiation with matter 
(this direction, as noted above, was the principal one), 
on the other hand, with the idea of quantization of radiant 
energy. The latter, although it did not have much following 
at the First Solvay Congress (whose general topic was “The 
Theory of Radiation and Quanta’), was still pursued by 
Einstein (although with interruptions because of his work 
on the general theory of relativity; see e.g. [4]). 

It is significant that along with Einstein’s ideas of the 
corpuscular structure of radiation the idea of quantization 
of radiant energy started to take another form. At first 
Ehrenfest in 1906 [106],* and then independently Debye 
in 1910 [107] applied the idea of quantization of an oscil- 
lator’s energy to the energy of natural oscillations of equi- 
librium radiation in a cavity on the basis of the results of 


31 This work was published by Ehrenfest in connection with 
Planck’s book [73]. 
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Rayleigh and Jeans. According to Ehrenfest and Debye 
the energy of each of such natural oscillations (the number 
of them is Z, = 8nv?/c?) must be quantized by the con- 
dition (17), just as for Planck’s resonator. Using this assump- 
tion, Debye applied a probabilistic method, analogous to 
that used by Planck in solving the problem of the distri- 
bution of total oscillatory energy Ey in the form of lumps 
of energy e over N resonators, to find the average energy 
of one natural oscillation of frequency v, which naturally 
appeared to be equal to the average energy (16) of one reso- 
nator. Multiplying this average energy by Z, Debye arrived 
at Planck’s law (1) and thus derived it from the properties 
of radiation itself, without recourse to any modelling of mat- 
ter by resonators. The fruitful idea of quantization of the 
energy of natural oscillations of a field was later used as the 
core of quantum electrodynamics and then of the unified 
quantum theory of fields (see, e.g. [108]). And it was only 
then that an interrelation between this idea and Einstein’s 
hypothesis of light quanta was revealed using the concepts 
of particle-wave dualism.?? 

We now consider two main paths taken by quantum theory, 
the first being connected with works by Bohr, and the second 
with works by Einstein and de Broglie. 


5. Further Evolution of Quantum Theory 


The evolution of various aspects of quantum concepts also 
determined the further, largely independent, development 
of quantum theory along two paths, which later resulted 
in two approaches in the development of quantum mechan- 
ics—one based on the correspondence principle (Bohr, Heisen- 
berg), and one based on the particle-wave dualism (Einstein, 
de Broglie, Schrödinger). The first path was associated with 
the key problem of explaining in quantum terms the prop- 
erties of the atom and atomic systems. The problem was 
especially clearly formulated by Bohr (see Sec. 3). His 
theory of the atom, which was essentially based on the cor- 
respondence principle, was the key element in solving the 


_ *® Note that previously this interrelation was considered in Ehren- 
fest’s work [109] and, in connection with the Bose-Einstein statistics, 
in Schrédinger’s work [110]. 
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problem. Bohr’s theory of the atom became an important 
milestone in the development of quantum theory. The 
second path was associated with Einstein’s quantum theory 
of radiation developed by him from the fundamental ideas 
of particle-wave dualism and further application of these 
ideas by de Broglie to microparticles. This approach had 
clearly demonstrated that the microworld has its charac- 
teristic features and that it was necessary to reject the 
graphic concepts of classical physics, according to which 
the corpuscular and wave properties are conflicting. 

In what follows we will sketch the most significant phases 
of the development of quantum theory along each of these 
two paths. 

Bohr’s theory of the atom and correspondence principle.! 
The first, and most important, stage of Bohr’s studies on the 
theory of the atom were the years 1913-16, beginning with 
his classical paper ‘On the Constitution of Atoms and Mole- 
cules’ [113], the first part of which (submitted for publication 
on 5 April 1913) contained the essentials of that theory. 
It should be emphasized from the outset that there are two 
aspects to Bohr’s theory [112]. On the one hand, it contains 
the fundamental assumptions about the existence of steady 
states and about radiative (optical) quantum transitions 
between them—the two Bohr postulates, which were then 
comprehensively tested experimentally and were proved 
in quantum mechanics and quantum electrodynamics. On 
the other hand, there is Bohr’s specific model of the atom, 
according to which motion in steady states is considered on 
the basis of classical mechanics with additional quantum 
conditions. A very important role, often overlooked, was 
played in Bohr’s development of the model theory of the 
atom by the correspondence principle. The idea of corre: 
spondence between quantum and classical theories was used 
by Bohr from the very beginning in the first part of [113]. 

The ideas on which Bohr drew in applying quantum con- 
cepts to the Rutherford model of the atom were those of 
quantization of the energy of matter and of the discrete 
nature of elementary processes of emission and absorption, 
which had been previously refined by a number of workers 
after Planck and Einstein. In particular, in 1910 Haas made 


1 See book [111], paper [9] and review [112]. 
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an attempt to determine, using Thomson’s model of the 
atom, the energy of the ground state of the hydrogen atom 
by equating the potential energy of the electron e?/a to 
the energy quantum hv, where v is the frequency at which 
an electron revolves in a circular orbit of a certain radius a 
(see [114], and also [3; 4, Chap. V]). In 1912 Nicholson 
sought to find a connection of patterns in atomic spectra with 
Planck’s constant, by quantizing the frequency of rotation 
of electron rings in atoms (see [3]). In his 1907-11 
works Stark, an adherent of the hypothesis of light quanta, 
used it to explain various elementary processes of interaction 
of radiation with matter. He considered the emission of 
light quanta when the energy of an electron is reduced as 
it is captured by an atom or as it moves in an extended 
elliptical orbit (see [4, Chap. IV]). 

In his postulates, which he employed to construct his 
theory of the atom and whose formulation he improved over 
the years’, Bohr combined the ideas of the quantization 
of the energy of matter and the discreteness of the elementary 
processes of emission and absorption of radiation with sub- 
stantially novel ideas. By the first postulate of Bohr’s, the 
atom is only stable for a certain set of states, ‘steady states’, 
which in the general case correspond to a discrete sequence 
of values of the energy of the atom. Each change in that 
energy is associated with a complete ‘transition’ of the atom 
from one steady state to another one. In this postulate the 
idea of quantization of the energy of the atomic system 
combines with the fundamental idea of stationarity of quan- 
tum states. By the second postulate, the atom’s ability to 
absorb and emit radiation is governed by a law by which 
the radiation associated with a transition must be mono- 
chromatic and it has frequency v, defined by the relation 
(frequency condition) 


hv = E; — Ep, (25) 


where h is Planck’s constant, E; and E, are the energies 
of two corresponding steady states. In this postulate the 
idea of the discrete nature of the elementary processes of 
emission and absorption is combined with the idea of the 


2 See [112]. Below these postulates are given as formulated by 
Bohr in his entry ‘Atom’ in the Encyclopaedia Britannica [115]. 
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frequency of a quantum transition distinct from the frequency 
of motion in the steady states between which the transition 
occurs. It should be stressed that before Bohr it was con- 
sidered obvious that the frequency of the emitted or absorbed 
radiation (energy quantum hv) must coincide with the fre- 
quency of orbiting in the atomic system. 

It is also of importance that one of the experimental cor- 
ner-stones of Bohr’s postulates was combination principle 
(see above).? The spectral terms 7,, 7,, Ty, ... (expressed 
in wave numbers) multiplied by —hc could immediately be 
interpreted as the energies of steady states, and the wave 
numbers of spectral lines oj, == Tp — T; multiplied by 
—he as differences of these energies, proportional to the 
frequencies of transitions v = vu = (E; — E,)/h = 
c (Th — T;) = coir. Bohr’s postulates, which mostly ex- 
plained spectra (throughout the entire range of frequencies, 
or energies of quanta hv), were thoroughly confirmed exper- 
imentally, specifically by the classical experiments of 
Franck and Hertz, and they appeared to be applicable both 
to atoms and to molecules (and later even for atomic nuclei). 
They were theoretically justified in quantum mechanics 
and quantum electrodynamics, and the name ‘postulates’ 
only retained its historical meaning. In modern quantum 
theory they represent its rather characteristic and important 
consequences.* 

Proceeding from his postulates, Bohr already in [113] 
achieved a spectacular success. For the hydrogen atom 
as the simplest system consisting of the nucleus with Z = 1 
and one electron, he estimates the Rydberg constant R in 
the relationship for the energies of steady states of this 


atom 
E, = —hcT,, = —hcRin?, (26) 


where n = 1, 2, 3, ... is an integer, which came later 
to be known as the principal quantum number, 7, is the 


3 Note that Bohr could eventually formulate his theory after he 
got acquainted with the spectral patterns (see, e.g. [4, Chap. VIII]). 

4 Note that textbooks and monographs so often present Bohr’s 
postulates incorrectly. For example, included among them is the con- 
dition for the quantization of the angular momentum for circular 
orbits in Bohr’s model (see below), and the entry ‘Atom’ in the last 
editions of the Encyclopaedia Britannica lists as many as four postula- 
tes (1!) 
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spectral term (see above). Having assumed from the very 
beginning that ‘the dynamic equilibrium of a system in 
steady states can be treated in terms of conventional me- 
chanics’ [113], i.e. classical mechanics, Bohr examined 
the motion of an electron of mass m in the hydrogen atom 
in an elliptical orbit under the action of the Coulomb attract- 
ive force exerted by the stationary nucleus (of mass M > m). 
He could obtain R (for the infinite nuclear mass, which is 
now denoted as Ræ) expressed through e, m, h 


R = 2n?me?lch?, (27) 


which beautifully agreed with experiment. Quite soon 
Bohr improved upon his remarkable result by taking into 
account the motion of the nucleus (in [117]; see also [116]). 
As a result, in (27) the electron mass m is replaced by the 
somewhat different reduced mass of the electron and the 
nucleus mM/(M + m) = m/(1 + m/M). This enabled Bohr 
theoretically to prove that the spectral series discovered by 
Fowler and assigned to hydrogen belongs in actuality to the 
Het ion.” 

To derive (27) required some additional assumption, which 
would make it possible to single out possible orbits corre- 
sponding to steady states. Bohr made a most convincing 
assumption [113] (he repeated it in [116]) that in the limiting 
case of low frequencies of transitions (obtained at large n) 
between steady states with energies En+ı and Æ, the fre- 
quency of this quanium transition v = (En+ı — En)/h = 
cR/(1/n? — 1/(n + 1)?] œ~ 2cR/n? must coincide ‘in ac- 
cordance with conventional electrodynamics’ [88] with 


5 For the hydrogen-like ion with nuclear charge Ze (Z > 2) for- 
mula (26) is replaced by E, = —hcRZ?/n2, specifically for Het E, = 
—4heR/n?. 

ê This formula was provided by Bohr in his reviewing report 
delivered on 20 December 1913 at the Physics Society in Copenhagen 
[116]. In [413] he used the constant X = cR, which has the dimensions 
of frequency. 

7 It iscommon knowledge that account of the motion of the nucleus 
played a decisive role in the discovery in 1932 of deuterium by spectral 
technique, and at present formula (27) with the reduced mass m = 
M,M,/(M, + M,) is widely uscd in the theory of ‘exotic’ atoms (e.g. 
positronium e*e~ and muonium pte-), which consist of two oppositely 
charged particles of masses M, and M, (such atoms are described by 
(26) with an appropriate R). 
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the classical frequency of orbiting w,.8 This immediately 
yields formula (27) for the Rydberg constant. Bohr’s as- 
sumption represented the original form of correspondence 
principle between quantum and classical theories, which 
was then developed by Bohr (see below) and at present 
should be viewed as one of the most important methodologic- 
al principles in physics [118, 119]. 

As a consequence of his model theory of the hydrogen 
atom—the consequence of quantization of the electron energy 
in this atom—Bohr immediately obtained for circular 
orbits? quantization condition for the angular momentum 


of the electron M, = mazo (p is the angular coordinate 


and ọ = dọ/dt is the angular velocity of the electron) in 
the form 


Mp = (h/2n) n. (28) 
This condition, which is a special case of the quantum con- 


dition of the type (22) at q = ọ and p = maig= M,, 
appeared to be useful for the further development of the 
model theory of the atom (both the hydrogen atom and 
complex atoms) by Bohr himself and especially by Sommer- 
feld, who formulated the generalized quantum conditions 
for a system of r degrees of freedom: 


G pidqi=n;h (i=1, 2, 3, ..., r), (29) 


where n; is the quantum number of the ith degree of free- 
dom, which takes on integral values. 

The conditions (29) are known to have been of use when 
applied at first to the electron in the hydrogen atom, and 
then to electrons in complex atoms. This allowed researchers 
to interpret a huge body of experimental evidence, notably 
the patterns in atomic spectra, including the splitting of 
spectral lines in external magnetic and electric fields (the 


8 He at first assumed (early in [113]) that the binding energy for 
an electron in a stationary orbit W, = En is nhw,/2, where o, = 
4n2me4/nh3 is the frequency of its revolution in such an orbit with 
a given value of n, which is calculated from classical mechanics. 

® Whose radii are ap = h?n?/An2me? = ayn?, where a, is the Bohr 
racs, which is now widely used in atomic physics as a natural unit 
length. 
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Zeeman and Stark effects, which were accounted for by 
the splitting of an atom’s energy levels in respective 
fields).?° 

It is to be emphasized that both the condition (28) and 
the more general conditions (29) (often termed the Bohr- 
Sommerfeld conditions) are additional conditions arti- 
ficially imposed in the model theory on the motion of elec- 
trons in steady states of the atom. The motion is considered 
according to classical mechanics, which is illegitimate for 
microscopic phenomena.’ Bohr himself understood per- 
fectly well the limitations of this approach, he believed 
it to be a forced measure. In [116] he wrote: ‘If we at all 
wish to gain a vivid idea of steady states, we have no other 
means, at least now, besides conventional mechanics.’ 
It was not until quantum mechanics was created that it 
turned out that the conditions (29) work out as the limiting 
case in the quasiclassical Wentzel-Kramers-Brillouin ap- 
proximation (see below) in accordance with correspondence 
principle. 

Bohr attached much importance to correspondence prin- 
ciple, he refined his ideas in the years 1918-20 [121, 122]. 
Bohr’s works in this period were the second stage of his 
research in quantum theory. Of significance in these works 
was the fact that instead of correlating classical motion in 
steady states to frequencies of such motion, each pair of 
states was associated with frequencies of the quantum tran- 
sition according to the frequency condition (25) (Bohr’s 
second postulate), the intensity of the radiant transition 
being determined by appropriate harmonic components 
in the expansion of the electric moment. Combining cor- 
respondence principle with quantization conditions, Bohr 
considered serial spectra of atoms of alkali metals charac- 
terizing, just like Sommerfeld, the steady states of the 


10 For more details about the quantum conditions in Bohr’s model 
theory of the atom see [3, Sec. 3.1] and [9]. The highlights of the theory 
were summarized by Sommerfeld in his classical monograph Atomic 
Structure and Spectral Lines, which saw four editions from 1919 to 
1924 (the last one was issued just before quantum mechanics was pro- 
duced [120)). 

11 That is why inclusion of the condition (28) into the number of 
Bohr’s postulates valid for the microworld is in principle unjustified 
(see Footnote 4 in this section). 
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external electron by the principal quantum number n 
and the azimuthal quantum number k, which determines 
the angular momentum by a law of the type (28): M, = 
hkl2s, where k takes on values 1, 2, 3, ..., n and the 
ratio k/n is the ratio of the semiminor to the semimajor 
axes of the elliptical orbit (at k = n, i.e. for circular orbits, 
formula (28) results).!? 

The third stage of Bohr’s atomic studies were his works 
in 1921-23, especially important of which was [63].1° In 
the latter Bohr gave a physical theory of the periodic system 
of the chemical elements based on the idea of filling up 
the electron shells (characterized by quantum numbers of 
individual electrons n and k) by successively binding the 
electrons by the nucleus. This manifests itself in optical 
spectra characteristic of each such attachment process 
[63]. Bohr was successful in revealing the patterns of 
filling the electron shells, specifically the order of binding 
the electrons with ever larger values of n and k to form groups 
of transition and rare earth elemcnts. At first in this 
paper, and then in more detail in the work [425] co-authored 
with Coster, Bohr analyzed characteristic X-ray spectra 
of atoms (whose origins were explored by Kossel in 1914-16 
[126, 127]). Bohr gave their thorough interpretation and 
identified ‘the role of X-ray spectra in accounting for the 
periodic system’. 

In determining the occupation numbers of electron shells 
with different pairs of values of n and k (nl-shells, to use 
modern termimology—1s, 2s, 2p, etc.) Bohr was guided by 
symmetry of the arrangement of electron orbits in the atom. 
He supposed that, given n, shells with different k (and hence 
with different l) are filled with the same number of electrons. 
Correct occupation numbers (2, 6, 10, 14, ...) were found 
in 1924 by Stoner [128], and in 1925 they were proved in 


12 It is common knowledge that in quantum mechanics azimuthal 
quantum number / = k — 1 assumes the values! = 0, 1, ..., n — 4, 
and Mp = (h/2n)? 1 (1 + 4). 

13 Together with (116, 122] it was included into the well-known 
collected works of Bohr [123] in an enlarged form (see also the remark- 
able Nobel lecture of Bohr [124]). 

14 A detailed examination of this classical paper, which is of great 
importance for chemistry, is given in [9]. 
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a classical work [129] by Pauli, who formulated his exclusion 
principle, which states that in an atom there may not be two 
or more electrons with the same values of the four quantum 
numbers required to characterize the state of an individual 
electron (in modern notation—n, l, j, mj or n, l, mj, mg). 
Three quantum numbers are needed to describe the orbital 
motion of an electron in an atom, and the fourth one is 
associated with the electron spin discovered by Goudsmit 
and Uhlenbeck late in 1925 [130, 131]. The Pauli exclusion 
principle appeared to be linked with the quantum statistics 
of particles with half-integral spin. Using this principle, 
Fermi in 1926 gave a treatment of quantization of ideal 
monoatomic gas and worked out a statistics, which came 
later to be known as the Fermi-Dirac statistics (see below)— 
a quantum statistics obeyed by electrons. 

Fermi’s discovery was a significant advance in Bohr’s 
theory of the atom. 

Taking stock of the fundamental results obtained by 
Bohr in respect of the quantum theory of the atom in 1913-23, 
we should appreciate the difficulty of the decisive step made 
by Bohr in 1913, when he audaciously rejected the concepts 
of classical physics. This is evidenced by comments of 
Rutherford and Einstein on Bohr’s theory. In 1936 Ruther- 
ford characterized the initial quantum theory of spectra put 
forward by Bohr as being one of the most revolutionary ones 
ever created in science. He went on to say that he knew of 
no other theory that would be more successful [132]. 
Einstein is known to have said concerning the implications 
of Planck’s law that all his own attempts to fit the theoretical 
foundations of physics to these results were complete failure. 
He felt as if ground was cut from his feet. He always mar- 
velled at Bohr who could work out the principal laws of 
spectral lines and electron shells of the atom on that shaky 
and contradictory ground. He qualified it as the highest 
musicality in thought [133]. 

Another achievement worthy of special mention is the 
development by Bohr of correspondence principle through- 
out the period from 1913 to 1925, a period just before the 
creation of quantum mechanics (see, in particular, Bohr’s 
works [134, 135]). It was from correspondence principle 
that Heisenberg produced the matrix formalism of quantum 
mechanics (see below). 
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Particle-wave dualism.© As early as 1905 Einstein [88] 
reflected on the wave and particle aspects of light, listing 
their applications—‘purely optical phenomena (diffraction, 
reflection, refraction, dispersion, etc.)’ and ‘phenomena of 
production and conversion of light’. This was already the 
beginnings of the approach to particle-wave dualism for 
radiation. But in [88] Einstein, starting off with Wien’s 
radiation law (6), only considered purely corpuscular prop- 
erties of radiation viewed as a set of independent quanta 
of light. In 1906-7 [97, 104], although he touched upon 
Planck’s radiation law (1), which implicitly contains par- 
ticle-wave dualism for radiation (see above), Einstein con- 
centrated on the issues of the quantization of the energy of 
matter and the theory of heat capacities. And it was only 
in 1909, in the third stage of his studies on quantum theory, 
that Einstein showed in his remarkable works [92, 136]2¢, 
based on an analysis of fluctuations of equilibrium radiation, 
that radiation possesses at the same time corpuscular and 
wave properties. He thus discovered particle-wave dualism 
for radiation (see [137]). This step was very important for 
the development of the entire quantum theory, but the 
significance of this discovery only became apparent later, 
when an understanding was achieved of the fundamental 
nature of the connection of wave and corpuscular properties 
and of the impossibility to reduce one set of properties to 
the other. 

Referring to his papers [97, 104], which contained ‘a mo- 
dification of the fundamentals of Planck’s theory’ (the 
relation (8) for average values holds in combination with 
the hypothesis of distinguished values of energy), Einstein 
posed the problem of ‘seeking the simplest possible inde- 


15 A more detailed treatment of the contribution of Einstein in 
1909-25 to the ideas of particle-wave dualism is given in [11, Secs. 5-7]. 

16 Work [136] is a reviewing report delivered on 21 September 
1909 at the 81th meeting of the Society of German Natural Scientists 
at Salzburg, where Einstein made his first appearance before the 
scientific community, which included leading theoretical physicists 
(among them Planck. Sommerfeld, Laue, Born) and experimentalists 
(among them Kaiser, Rubens, Stark, J. Franck) (see [4]). Speak- 
ing, in particular, about the ea of relativity he concluded that 
this theory ‘changes our views of the nature of light in that respect 
that light appears in it not in connection with a hypothetical medium, 
but as some independent entity, just like matter’ [136]. 
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pendent interpretations of Planck’s radiation formula’ 
and found ‘two approaches to this issue, which were re- 
markable for their simplicity’ [92], namely the exam- 
ination for equilibrium radiation of energy fluctuations 
and momentum fluctuations. 

For the fluctuations e = n — no of the energy of radi- 
ation ņ contained in volume v about its average value n, = 
E = vu,,rdv (for the frequency interval from v to v + dv; 
cf (19)) Einstein arrived at a formula, which can be written as 

= 25 \ı-1 
= —k (Fr) : (30) 
where S = vp,,rdv is the average entropy of the radi- 
ation. It can be found knowing u,,7.!’ Einstein refers to 
the formula relating @,,r and u,,r given by Planck [73]. 
‘After a simple calculation’ Einstein arrived at!® 
= c3 E2 
Shy E+ 5an y (31) 


v 


which can also be written as 
e= (hvus, r+ gr us) v dv. (32) 


The first term in (31) ‘gives energy fluctuations as if radi- 
ation consisted of point quanta of energy hv moving inde- 
pendently’. This is a corpuscular term, and if we put Elw = 
n, e/hv = An, where n is the average number of photons, 


and An = n —n is the change in this number, then (An)? = 


17 Note that 
dS 49 eur 
dE? vdv dui p ' 


= Bu T | 
2= — — wT 
e [*( du? y ) vdy, 





hence 


where the expression in brackets is independent of v and dv. 

18 This result is readily obtained if one takes into account that 
dS/dE = dp,,r/du,,r = 1/T (see (18)), hence d?S/dE? = (—1/T?) X 
aT/dE and for energy fluctuations one obtains an important for- 
mula, which was derived by Einstein back in 1904 [138] (see also 
[11)): e? = kT? (dE/dT). Differentiating E with respect to 7, using 
the law (1), gives (31) and (32). 
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(e/hv)? = Elhv = n, i.e. we get a conventional formula for 
the fluctuations of the number of particles. The second term 
in (31) describes the fluctuations according to wave theory 
as resulting from interference, which Einstein [92] worked 
out from an analysis of dimensions, and then Lorentz de- 
rived in a rigorous way [139]. The additivity of fluctuations 
is quite important. In the limiting case of large frequencies 
v and low densities of radiant energy u,,r there remains 
only the first, corpuscular, term, and in the opposite limit- 
ing case of small v and large u,,r, only the second, wave, 
term. In the general case corpuscular and wave fluctuations 
are additive. Radiation thus behaves at the same time in 
a corpuscular and a wave manner, which is characteristic 
of particle-wave dualism. 

Note that if instead of absolute values we introduce rela- 
tive values, then (31) assumes a very simple form and its 
physical meaning is revealed in a graphic fashion. By divid- 
ing £? = (qn — E)? by E? and considering that E = n-hv 


and n n'hy, we get 
e _ (n—n} (An? 1 1 
E? n? mn q’ (20) 


where q = v (8nv?/c?) dv = vZ, dv is the number of natural 
oscillations of the radiation field.?° 

Einstein looked for the fluctuations of radiation momentum 
A by considering the fluctuations during the time t of the 
pressure exerted by the radiation on a freely suspended 
mirror of area f on the assumption that it ideally reflects 
radiation within the frequency range from v to v + dv 
and is transparent to all other frequencies. Einstein used 


19 It was because of this additivity of terms proportional to uy,r 
and uy,T in (32) for the r.m.s. fluctuation e?, and hence for 1/(d?S/dE?) 
in (30), that Planck’s interpolation formula (13) is correct. In it 
E,,r is proportional to u,,r and can be viewed as the average energy 
of one natural oscillation of the radiation field, and ®S,r/dEN,T 
is proportional to d2S/dE? (see [93]).: 

20 A formula of this type was applied by Einstein in 1911 [105], 
to fluctuations of oscillatory energy in a solid. He interpreted n 
(in his notation Zg) as ‘the average number of Planck’s “quanta” 
available in the body’, i.e. phonons, in modern terminology [11]. 
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the law (1) to arrive at 


2 
at (hvus, rtg ui, r) fdv. (34) 
This relation is absolutely analogous to (32), and so he 
wrote: ‘In that case the formula tells us that, by Planck’s 
formula, both causes of fluctuations appear mutually inde- 
pendent (the additivity of terms that make up the square of 
fluctuations)’ [92]. 

Einstein attached much importance to the approach based 
on the consideration of the fluctuations of radiation mo- 
mentum. In his report [136] he chose it by stressing that 
this approach ‘owing to its graphic nature seems ... to 
be especially convincing’, and on the basis of (34) he made 
his conclusion about the existence of inhomogeneities in 
the distribution of radiation momentum, such that they 
‘for low density of radiant energy are much larger than those 
which follow from wave theory’. Einstein concludes his 
report by saying that ‘one should not view as incompatible 
both structures (wave and quantum), which must be in- 
Mae in radiation in accordance with Planck’s formula’ 
[136]. 

Einstein thus made an important contribution to quantum 
theory by providing support, on the basis of energy and 
momentum fluctuations, to the idea of particle-wave dualism 
for radiation, i.e. for the presence of two structures, wave 
and corpuscular, in any radiation. Hund correctly indicated: 
‘In his study of statistical fluctuations in radiation Einstein 
introduced particle-wave dualism for light’ [6]. 

Significantly, back in 1909 Einstein came to the con- 
clusion: ‘An elementary radiation process is, obviously, 
a directed one’ [136]. This led him later to the important 
supposition that the photon possesses a momentum hv/c 
in addition to energy hv. Already in (34) the radiant energy 
density u,,r is multiplied in the first term by hv/c, just like 
in (32) it is multiplied by kv. But explicitly the photon 
momentum was only introduced by Einstein in 1916 [140]. 
In the latter paper Einstein characteristically combined 
a statistical approach with an examination of elementary 
processes. This fundamental work, which marked the fourth 
phase in Hinstein’s research on quantum theory, is widely 
known for the fact that in it Einstein considered the exchange 
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of energy for radiation in equilibrium with gas molecules 
and introduced the coefficients Ain, Bris and Bip (Ein- 
stein’s coefficients, which characterize the probabilities of 
spontaneous and induced quantum transitions?! between 
two energy levels E, and E,) and derived, using the detailed 
balancing principle and Wien’s displacement law (4), 
Planck’s formula (1) (the derivation of Planck’s formula 
according to Einstein) and Bohr’s frequency condition (25) 
(‘the second principal rule in Bohr’s theory of spectra’).” 

It is worth noting that Einstein regarded his paper [140] 
as an application of ‘the original Wien’s consideration’ 
(applied by Wien to derive the radiation law (6) in [80]). 
He wrote: ‘Proceeding from certain hypotheses concerned 
with emission and absorption of radiation by molecules, 
which could be understood from the viewpoint of quantum 
theory, I showed that in temperature equilibrium mole- 
cules with states distributed according to quantum theory 
are in dynamic equilibrium with Planck’s radiation; in 
such a way Planck’s formula (11) is obtained in an amazingly 
simple and general manner’ [140]. At the same time 
Einstein dealt at length with ‘the transfer of momentum 
to molecules’ in absorption and emission. He then consid- 
ered molecular motion in the radiation field with due 
account of momentum fluctuations and Planck’s law (1) to 
conclude ‘that we will come to an unequivocal result only if 
all the elementary processes will be viewed as completely di- 
rected’ [140]. 

At the end of that work Einstein stated as a proved fact 
that if a molecule in an elementary induced process receives 
or imparts energy hv in the form of radiation, it always 
also receives the momentum hv/c: in absorption—in the 
direction of the incident beam of rays, and in induced emis- 
sion—in the opposite direction. Further Einstein stressed 
that for spontaneous emission ‘the process is directed as 
well. There is no spontaneous emission in the form of spheric- 
al waves. In an elementary process of spontaneous emission 
a molecule receives a recoil momentum, whose value is 
equal to hv/c, and the direction is determined, according 


21 Jt was very important to take into consideration induced emis- 
sion in addition to spontaneous emission and absorption. 
22 These results had already been published by Einstein in [144]. 
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to the present status of the theory, only by “chance”’. And 
he pointed out that ‘these properties of an elementary pro- 
cess ... make the creation of a true quantum theory of 
radiation inevitable. The theory has the weakness that, for 
one thing, it does not lead us to a more close contact with 
wave theory, for the other, the time and direction of an 
elementary process are governed by “chance”; although 
I am quite confident in the reliability of the method 
chosen’ [140]. Einstein thus came to a probabilistic treat- 
ment of the connection between corpuscular and wave 
properties of radiation—to a probabilistic interpretation of 
particle-wave dualism. Later on Born, giving his probabilistic 
interpretation of quantum mechanics, developed further 
Einstein’s ideas (see below). 

An important thing about the paper [140] was that Ein- 
stein, using detailed balancing principle to treat the inter- 
action of radiation with matter, took into consideration 
for elementary processes the discreteness of exchange both 
of energy and momentum of radiation according to the main 
relations for radiation 


E =h, p= hyle = hh, "(35) 


where £ is the energy transferred, and p is the magnitude 
of the momentum p transferred. He emphasized also the 
directivity of this process. It is to be noted that in 1905 
[88], in dealing with the interaction of radiation with matter, 
Einstein only considered the discrete nature of the energy 
transferred, not the momentum, and he did not yet pose the 
question of whether or not the elementary processes of radi- 
ation are directed (this question Einstein first mentioned 
in 1909 [136] and solved it in [140]). 

Relations (35) express particle-wave dualism for radiation, 
connecting the photon’s energy and momentum with the 
wave characteristics v and c/v = à of that radiation as electro- 
magnetic waves. In terms of the wave vector k they can be 


represented as 
E = hy, p=hk, (36) 
where k has the magnitude k = v/e = 1/4. 
23 Now the magnitude of k is assumed to be 27/À, and the relations 


(36) are written as E = hw and p = &k, where h = h/2n, and o = 
2nv. Then the wave number o = v/e = kl2n. 
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Note that in four-dimensional formalism of the theory of 
relativity the relations (36) can be written as 


P = ñK, (37) 


where P is the four-dimensional momentum with the compo- 
nents P, = px, Pa = py, Ps = pz, P, = iElc, and K is a 
four-dimensional wave vector with the components K, = kx, 
K, = k}, K; = k,, K, = ik = iv/e. It was for relativistic 
treatment that de Broglie later generalized particle-wave 
dualism to include the case of matter, i.e. microparticles 
with nonzero rest mass (see below). 

The notion of radiation as a multitude of particles of 
a special sort—radiation quanta—with energy hv and 
momentum hv/c = h/A turned out to be quite useful in the 
explanation in autumn of 1922 of Compton’s discovery— 
the change of the wavelength of X-rays scattered by elec- 
trons (Compton’s effect) [142]. The well-known elementary 
theory, worked out by Compton himself [143] and inde- 
pendently by Debye [144], was used to explain Compton’s 
effect by a collision of a photon with a free electron that 
obeys the energy and momentum conservation laws. The 
theory agreed with experiment and was later confirmed as 
applied to individual elementary processes of scattering,” 
which suggested that radiation does possess, in addition 
to wave properties, corpuscular properties in accordance 
with particle-wave dualism. 

Einstein wrote in 1924 [148] about Compton’s experi- 
ments: ‘The positive result of Compton’s experiment indi- 
cates that radiation behaves as if it consisted of discrete 
corpuscles not only in the sense of energy transfer, but 
also in the sense of momentum transfer.’ And after the 
applicability of the theory of Compton’s effect to individual 
elementary scattering processes had been confirmed, Bohr, 


24 Bohr, Kramers and Slater made a suggestion in 1924 in their 
work [145], which made quite a stir, that Compton’s effect only obeys 
the energy and momentum conservation laws statistically, but it was 
disproved in 1925 by experiments carried out by Bothe and Geiger 
[146] and Compton and Simon [147], which unequivocally suggested 
the fulfillment of the energy and momentum conservation laws for 
each elementary process of photon scattering by individual electrons. 
It is worth noting at the same time that the paper [145] contained also 
considerations important for the further development of quantum 
theory, specifically the one about the virtual radiation field (see, e.g, 


’ 
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a hitherto opponent of the theory of light quanta (as con- 
flicting with classical electrodynamics), made in July 
of 1925 his important comment [151]: ‘... the question of 
whether or not there exists a connection between individual 
atomic processes should not be treated as simply a difference 
between two clearly distinct interpretations of propagation 
of light in empty space, which would correspond to the 
corpuscular or wave theories of light’. And further: ‘... one 
should be prepared for the prospect that the desired general- 
ization of classical electrodynamics would require a decisive 
rejection of concepts on which the description of nature has 
rested so far’ [151]. This is what really occurred in 
the process of creation of quantum mechanics and quantum 
electrodynamics. 

Se far we concerned ourselves with the evolution of the 
ideas of particle-wave dualism as applied to radiation. Of 
exceptional importance was the application of these ideas 
to matter—to microparticles with nonzero rest mass. This 
was realized by de Broglie in his remarkable papers of 
1923-24," 

De Broglie published three small, although very impor- 
tant, papers in Reports of the Paris Academy of Sciences 
[153-155] (they were reported on 10 September, 24 September 
and 8 October 1923), and then a summary [156]. He gave 
a detailed account of his results in his doctor’s thesis [157] 
(defended on 29 October 1924). He suggested the funda- 
mentally new idea of describing the motion of microparticles 
by waves, called de Broglie’s waves. Already in his first paper 
[153] de Broglie gave a relativistic treatment of the motion 
with velocity v = Be of a particle of rest mass m, # 0,78 


25 Just like Planck and other scientists (see above). It should be 
noted, however, that Bohr’s response to the idea of the quantization 
of the natural oscillations of electromagnetic field according to Debye 
(see above) was positive. In 1921 [149] he compared the derivations of 
Planck’s radiation law (N) given by Einstein [140] and Debye [107] 
(see also comments [150]). 

26 He referred to the results of the Bothe and Geiger experiments 
and to the work he co-authored with Kramers and Slater [145]. 

: es; prehistory of de Broglie’s studies is given in [3] and also 
in ; 

28 De Broglie at first attempted to think of the quantum of light 
as such a particle (the paper [156] was entitled ‘A Tentative Theory of 
Light Quanta’), which was wrong. But his main idea of waves assigned 
to moving particles turned out to be unusually fruitful. 
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and ascribed to that particle an ‘intrinsic periodic phenom- 
enon’ with frequency v, according to the quantum relation 
for energy hvo = m,c*. Comparing the frequency of such 
a process v, = vy V1 — ß? for a stationary observer with 
the frequency obtained from the relation hv = m,c?/V 1—f?, 
which is equal to v = v,/V1 — ß?, de Broglie came to the 
conclusion that a particle is accompanied by a phase wave 
that does not carry energy and propagates with the velocity 
V = c/B = c*/v >c. He wrote that ‘if the intrinsic phenom- 
enon in a moving body coincides at the initial moment of 
time with the wave, this phase correspondence will also 
persist later’ [156]. 

Further de Broglie proved (for the first time in [154]) 
that the velocity of the particle, v, which enters into the 
expression for the phase velocity V, equals ‘the velocity of 
propagation of the energy of a group of waves’ [156], i.e., 
to the group velocity of phase waves (equal to dv/dk, 
whereas V = v/k = vA). Note, however, that any attempts 
to identify a particle with a group of waves—with a wave 
packet—made by de Broglie later appeared to be unsuccess- 
ful, since the wave packet smeared out with time and only 
a probabilistic interpretation of de Broglie’s waves made 
it possible correctly to understand the connection between 
wave and corpuscular properties, just as for radiation (cf 
above). 

For a phase wave the wave number k = 1/4 = v/V, from 
hv= m,c?/V1 — Band V = c2/v, is myv/h Vi — B® = phh, 
i.e. p = hk = h/d, where p is the momentum of the par- 
ticle. This is a fundamental relation of particle-wave dua- 
lism for microparticles, similar to the second relation of (35) 
for radiation. It was introduced explicitly by de Broglie 
in his thesis [457]. In the chapter ‘The Maupertuis principle 
and the Fermat principle’ de Broglie provides a general 
relativistic formulation of the relationship between the four- 
dimensional wave vector and the four-dimensional’ momen- 
tum (see (37))—of the ‘vector of world wave’ with the ‘famous 
vector ofthe “world”momentum”, which unites energy and 
momentum’. De Broglie formulated the following point: 
‘The Fermat principle, if applied to the phase wave, is identical 
to the Maupertuis principle, if applied to a moving” body. The 
possible dynamic trajectories of the moving body are identical 
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with the possible rays of the wave.’ Ile went on to say: ‘It 
seems that such an idea of profound connection between 
the two principles of geometrical optics and dynamics is 
of immense importance for finding the path to the synthesis 
of waves and quanta.’ De Broglie also understood the con- 
crete consequences of the presence of wave properties in 
microparticles, and already in [154] he mentioned the pos- 
sibility of finding the diffraction of electrons. It is well 
known that these possibilities were only realized in the 
years 1927-28. 

Of fundamental importance was the interpretation of the 
quantum conditions in Bohr’s model theory of the atom. 
For the case of a phase wave travelling in a closed trajectory 
of an electron with the velocity V = vA in the atom de 
Broglie formulated his stability condition: ‘Motion will 
only be stable if the length of the phase wave is compatible 
with the trajectory length’ [156]. For a circular orbit 
of radius a this immediately gives nA = 2na, where 
n is an integer, and leads to the quantum condition (28).?? 
De Broglie also obtained more general conditions for quant- 
ization (29) for the three-dimensional motion of the electron 
(‘the Sommerfeld conditions for quasiperiodic motion’). 
For the first time the quantum conditions were found in a natural 
way as a manifestation of the wave nature of microparticles. 

De Broglie’s ideas were received sceptically by most of 
the scientific community as conflicting with the common 
notion of purely corpuscular nature of matter. Einstein, 
however, paid attention to de Broglie’s results and appreci- 
ated their fundamental significance. At the time Einstein 
was engaged in his research on quantum statistics. And 
in the summer of 1924 the Indian physicist Bose sent to 
Einstein his work [158], which contained a derivation of 
Planck’s formula (1) on the basis of statistical computation 
of the number of possible distributions of photons over cells 
of phase space. Einstein translated Bose’s article from 
English into German and submitted it for publication with 
the following note: ‘The derivation of Planck’s formula 
suggested by Bose is in my opinion a great achievement. 


2® In fact, for à = V/v, where V = c2/v and v = myc*/h VI — PR, 
we get à = h V1 — B2/mgv = hlp, hence nh/p = 2na and Mp = 
ap = nh/2n. 
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The technique he used also gives a quantum theory of ideal 
gas, which I will lay down elsewhere’ [159]. He did it in 
his papers [160, 161], which constituted the fifth phase of his 
quantum theory studies. Bose’s and Einstein’s results thus 
combined to give rise to a quantum statistics—the Bose- 
Einstein statistics—a result of the evolution of the quantum 
theory of radiation and of the idea of particle-wave dualism. 

In [160] Einstein wrote: “There is no yet quantum theory 
of the monoatomic ideal gas not based on arbitrary specula- 
tions. This gap is filled in below using a novel technique, 
suggested by Bose and used by him for an exceptionally 
interesting derivation of Planck’s radiation formula.’ Ein- 
stein then gave a clear and concise characteristic of that 
technique: ‘The phase space of a certain elementary object 
(in this case, of a monoatomic molecule) divided by a given 
(three-dimensional) volume, is made up of “cells” of vol- 
ume Ah?. If there is a large number of elementary formations, 
their microscopic distribution, considered in thermody- 
namics, is characterized by the way in which the elementary 
formations are arranged over these cells. The “probability” 
of a certain macroscopically determined state (in Planck’s 
sense) equals the number of the various microstates that 
may realize the given macrostate. The entropy of the macro- 
state, as well as statistical and thermodynamic properties, 
of the system are then found using Boltzmann’s formula.’ 

It is quite significant that for the ideal monoatomic gas 
that obeys the Bose-Einstein statistics the relative fluctu- 
ations of the number of molecules are given by a formula that 
is absolutely analogous to (33) for the relative fluctuations 
of radiant energy (and the number of photons). Einstein 
also wrote down this formula in the form [161] 


(A,/n,)? = 1/n, + 1/z,, (38) 


where n, =n is the average number of gas molecules, 
‘belonging to the energy region AZ’, in a volume V, which 
‘communicates with an infinitely large volume of the same 
gas’, A = n — n = n — n, are the fluctuations of the 
number of molecules n, and z, is the number of cells of the 
phase space corresponding to AZ. Einstein pointed out that 
the second term in the case of radiation is determined by 
‘interference fluctuations’ and that this term ‘can be impart- 
ed an appropriate meaning in a gas as well, by putting into 
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correspondence to the gas a certain emission process and 
computing for the latter its interference fluctuations’. Ein- 
stein supposed ‘that this is not the case of a simple analogy’, 
he referred to de Broglie’s thesis [157] as a work ‘worthy of 
any attention’ (indicating in a reference that it ‘contains 
a very interesting geometrical interpretation of Bohr- 
Sommerfeld quantum rules’ (see above). He added that de 
Broglie showed ‘the way in which a material particle or 
a system of material particles can be correlated to a (scalar) 
wave field’. Einstein found ‘by direct calculation that 
1/z, is the mean square of the fluctuations of this wave 
field, which corresponds to the energy region AE under 
consideration’. 

We see thus that Einstein not only understood the im- 
mense importance of de Broglie’s ideas (which were a gener- 
alization to microparticles of Einstein’s ideas about par- 
ticle-wave dualism), but he succeeded in finding an appli- 
cation for them in quantum statistics. It is to be stressed 
that for the fluctuations of equilibrium radiation Einstein 
distinguished the first, corpuscular, term, whereas for the 
fluctuations of the ideal monoatomic gas he distinguished 
the second, wave, term. 

Einstein’s support of de Broglie’s ideas was instrumental 
in their dissemination. In 1925, after he had read Einstein’s 
paper [161], Schrödinger paid his attention to de Broglie’s 
results [162, 163]. He developed de Broglie’s ideas and thus 
became one of the initiators of quantum mechanics.*° 


6. Conclusion 


We have considered the origins of quantum theory by trac- 
ing their evolution throughout the 19th century, and final 
emergence of quantum theory at the turn of this century, its 
tantalizing development during the first quarter of the 
20th century. This development resulted in the formation 
of quantum theory in the middle and end of the 1920s, 
a period about which Bohr later wrote: “Those years, when 


% Note that even before his classical works on the wave form of 
quantum mechanics Schrödinger in his remarkable paper [110] em- 
phasized the importance of these ideas. 
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an unparalleled pooling of efforts of an entire generation 
of theoretical physicists from many countries step by step 
created a logically consistent generalization of classical 
mechanics and electrodynamics, were sometimes charac- 
terized as the “heroic” era of quantum physics. Each person, 
who followed those developments, will remember as an 
unforgettable experience the picture of how, as a result of 
a combination of the various approaches and the use of 
appropriate mathematical techniques, emerged a new 
view of the understanding of physical experiment’ [67]. 
Quantum theory grew up swiftly. The creation of nonrela- 
tivistic quantum mechanics as a result of the discovery of 
its mathematical tools and the physical interpretation of 
them, the development of quantum statistics, the working 
out of the foundations of relativistic quantum mechanics, 
the beginning of quantum electrodynamics and the unified 
quantum theory of fields, successful applications of quantum- 
mechanical techniques—this all took just several years, 
beginning in the summer of 1925 [164]. 

What follows is a summary of highlights of the major 
developments in quantum theory.! 

Nonrelativistic quantum mechanics started with the dis- 
covery of its mathematical techniques in two forms— 
algebraic and analytical—in accordance with the two paths 
of the development of quantum theory in the previous years 
(see Sec. 5). 

In July 1925 (when Bohr wrote that one should be pre- 
pared for a decisive rejection of earlier concepts, see above) 
Heisenberg? [166], developing, after Bohr, correspondence 
principle’, sought to ‘construct a quantum-theoretical me- 
chanics similar to classical mechanics, such that it would 
only include relationships between observed quantities’ 
(see [166]). He worked out an algebraic technique, which 


1 A more detailed account of these developments is to be found 
in [3, 6] and also in [10]. The latter also contains a chronologically 
arranged references, including more than 200 references to original 
publications on quantum theory during the important period from 
June 1925 to March 1928; see also [165]. Later in the article we only 
refer to the most important of the publications during the formation 
of quantum theory. 

2 About previous works of Heisenberg see in [167]. 

3 About Bohr’s influence on Heisenberg see Heisenberg’s me- 
moirs [168]. 
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enabled him to quantize the harmonic and anharmonic 
oscillators. This was matrix mechanics (that Heisenberg 
introduced matrices without knowing it was indicated by 
Born), and it was further developed by Born, Jordan, and 
Heisenberg [169, 170]. Dirac [171] produced the algebraic 
formalism of the ‘mechanics of g-numbers’. It was at that 
time that first appeared commutation relations [169, 174], 
and Born and Wiener [172] soon for the first time used the 
theory of linear operators by putting into correspondence 
to each physical quantity a Hermitian operator. On the basis 
of matrix mechanics Pauli in 1926 solved successfully 
the problem of the energy levels of the hydrogen atom and 
derived the formula (26) [173]. 

Schrédinger followed another path. He improved upon de 
Broglie’s ideas about the wave properties of microparticles 
and worked out the analytical formalism of quantum me- 
chanics—wave mechanics.* Within half a year he published 
four papers under the general title ‘Quantization as an 
Eigenvalue Problem’ [174]. Even the first of these (January 
1926) contained a wave equation, the stationary Schrédinger 
equation, and its solution for the hydrogen atom, which is 
now provided in all textbooks on quantum mechanics 
(and much simpler than Pauli’s solution [173] using matrix 
technique). In the second paper Schrödinger used optical 
and mechanical analogies to compare in detail ‘geometrical’ 
and ‘wave’ mechanics. He solved the problems of the oscil- 
lator and the rotator in a natural and elegant way (as the 
problem of the hydrogen atom in the first paper) and his 
results agreed perfectly well with those obtained using 
matrix mechanics. The third paper was devoted to the 
theory of perturbations and applied it fairly successfully 
to Stark’s effect; and in the fourth paper appeared a time- 
dependent wave equation—the nonstationary Schrédinger 
equation. 

Naturally, the questions presented themselves: what is 
the connection between the matrix and wave forms of me- 
chanics? What significance should be attached to the new 
mathematical formalism, specifically to the wave func- 
tion p? 


4 For more details see (162, 163]. 
6-01071 
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An answer to the first question was given by Schrédinget 
already in March 1926 [175] (and independently by Pauli, 
who did not publish his results [176] and by Eckart [177, 
178]). Schrödinger demonstrated that matrix and wave 
mechanics are mathematically identical. He used the theory 
of linear operators (independently of Born and Wiener, see 
above) and proved that from the operator form of the wave 
equation one can obtain matrix mechanics, and matrix 
elements are found using wave functions in the now familiar 
way. It thus became clear that matrix mechanics and wave 
mechanics are two mathematical forms of the same general 
theory, quantum mechanics as we call it now. 

The most significant question of the physical meaning 
of the wave function was solved by Born in his paper on 
the quantum mechanics of collisions [179].° Drawing on 
Einstein’s ideas about probabilistic connection of the wave 
field with light quanta (see above), Born gave a probabilistic 
interpretation of the wave function and its expansion coef- 
ficients. This interpretation played a great role in the 
further development of quantum mechanics—it became 
the basis for a correct physical interpretation when the 
general mathematical tools of quantum mechanics as a 
unified and consistent theory of atomic systems were de- 
veloped. 

A general formulation of nonrelativistic quantum mechan- 
ics was given in December 1926 by Dirac [181] and Jordan 
[182] independently (as it occurred in many cases during 
this period of vehement growth of quantum theory). In 
these publications transformation theory was laid down. 
Dirac’s version was a very elegant one; specifically, he 
introduced his famous 6-function (Dirac’s 6-function). Both 
Dirac and Jordan in their physical interpretations of the 
mathematics started with Born’s probabilistic interpretation 
(note that Jordan [182], referring to Pauli [183], treats 
transformation functions as probability amplitudes). The 
mathematics of quantum mechanics was further refined in 
April 1927 by Hilbert, Neumann, and Nordheim [184], who 


5 Born’s preliminary report, dated 25 June 1926, was published 
by Zeitschrift für Physik after the Naturwissenschaften journal declined 
it ‘for want of space’ (!), although Born wrote in the notes: ‘I believe 
that its publication here will not seem unnecessary’ [180]. 


Quantum Theory. Origins and Growth 83 


applied the general theory of linear operators in Hilbert 
space. Neumann then published some papers [185-187]. 
In the last one he considered mixed ensembles. Later Neu- 
mann summarized his results in the munograph [188]. 

Concluding breakthroughs in nonrelativistic quantum 
mechanics were the discovery by Heisenberg of the uncer- 
tainty principle [189] and the formulation by Bohr of the 
complementarity principle [190]. 

Heisenberg in March 1927 formulated his uncertainty 
relations for coordinate and momentum, and also for energy 
and time [189]; he gave a thorough mathematical treat- 
ment of these relations on the basis of the general Dirac- 
Jordan theory, and examined in detail a number of thought 
experiments, including the one with an atomic beam passing 
through two inhomogeneous magnetic fields. Heisenberg 
discussed his results with Bohr and took notice of the latter's 
comment. Physicists were quick to appreciate the importance 
of Heisenberg’s contribution, and the uncertainty relations 
for coordinate and momentum became a cornerstone of 
quantum theory. It reflects the physical picture of micro- 
scopic phenomena for which classical concepts are only of 
limited use. 

A fairly profound review of the fundamental issues of 
quantum theory was given by Bohr in his report [190], 
in which he put forward the concept of complementarity. 
The report was delivered by Bohr at the Como International 
Congress of Physicists, Italy, 16 September 1927, and pro- 
voked much discussion, which was carried on in October 
1927 at the Fifth Solvay Congress. And there started the 
famous dispute of Bohr with Einstein, which was to last 
for nearly 30 years, up to Einstein’s death.® 

Quantum mechanics stimulated the development of quan- 
tum statistics, two types of which were discovered by Bose 
and Fermi (see above). Another significant development was 
relating the quantum statistics of the two types with the quantum 
mechanics of systems consisting of identical particles. Heisen- 
berg in June 1926 [194] and independently Dirac in August 
1926 [195] showed that states of a system of identical in- 
distinguishable particles come as symmetrical and antisym- 


€ For more details about this discussion see [191]. The issues of 
the complementarity principle are still widely discussed [192, 193]. 


6* 
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metrical under interchange of two such particles. The case 
of symmetrical states corresponds to the Bose-Einstein sta- 
tistics, and the case of antisymmetrical states, to the statistics 
proposed by Fermi (and obtained also by Heisenberg and 
Dirac, who were unaware of Fermi’s results [196], concerned 
with the quantization of the ideal monoatomic gas, published 
early in 1926). Dirac dealt at length with the statistics of 
antisymmetrical states, and it soon came to be known as 
the Fermi-Dirac statistics. It was found later [197] that the 
Bose-Einstein statistics is valid for particles with integral 
spin (including zero) and the Fermi-Dirac statistics for 
particles with half-integral spin (see [108]), and in 1941 
this was substantiated by Pauli [198]. Quantum statistics 
thus turned out to be closely linked with the properties of 
microparticles. 

Also in close connection with quantum mechanics first 
strides were being made by quantum electrodynamics. 
The foundations of the quantum theory of electromagnetic 
radiation were laid by Dirac in his fundamental paper [199] 
dated 2 February 1927. He succeeded in finding the Hamil- 
tonian for a combined system consisting of an atom and 
a radiation field. For the first time he applied second quanti- 
zation to radiation proceeding from commutation relations 
for dynamic variables describing a free radiation field, 
the description being the same for spontaneous and induced 
emissions. Then Dirac also considered [200] some issues of 
the theory of dispersion. Dirac’s papers on radiation theory 
organically combine corpuscular and wave concepts, and 
these works mark the beginning of quantum electrodynamics 
and in general of the quantum theory of fields (see [108]). 
An important advance for quantum electrodynamics were 
the results of Jordan and Pauli [201] and later publications 
of Heisenberg and Pauli [202]. Fermi’s papers [203, 204] 
and his 1930 lecture on the quantum theory of radiation 
[205] are also worth noting. Second quantization was applied 
to microparticle systems obeying the Bose-Hinstein statistics 
by Jordan and Klein [206], and to microparticle systems 
obeying the Fermi-Dirac statistics by Jordan [207] and 
Jordan and Wigner [208]. 

Of fundamental importance were Dirac’s works on rela- 
tivistic quantum mechanics published early in 1928 [209]. 
In them he obtained and examined the relativistically invari- 
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ant wave equation for the electron—the famous Dirac equation. 
Dirac showed that his equation suggests that there exist 
the electron spin and spin-orbit interaction. It also correctly 
describes the fine structure of energy levels of the hydrogen 
atom. The immense importance of this equation became 
obvious at a later date, when it was first shown (also by 
Dirac himself in his theory of ‘holes’) that it suggests the 
possibility of the existence of antiparticles, and of the 
creation and formation of pairs. And then antiparticles 
were really discovered (beginning with the discovery of 
the positron in 1932). 

Even when quantum mechanics was being established (the 
end of the initial phase was the derivation by Dirac of 
his equation) there were a wide variety of applications of 
quantum theory to phenomena at the atomic and molecular 
level—in the theory of atoms and molecules, including 
quantum chemistry, in the theory of collisions and Compton 
effect and photoelectric effect. Also noteworthy are the 
development and wide applications of approximate methods, 
such as perturbation methods and variational methods. 
Answers were given to the important questions of interrela- 
tion between classical and quantum mechanics. As early as 
1926 was developed the quasiclassical method, suggested 
independently in [210-212] and generally known as the 
Wentzel-Kramers-Brillouin method (or the WKB method 
for short). This method is based on representing the wave 
function p in the form wp = exp (2niS/h), where S is the 
action function, and on expanding S into powers of h. This 
made it possible to obtain in a first approximation the 
quantum conditions (29). 

An especially graphic illustration of the connection be- 
tween quantum mechanics and classical mechanics was 
obtained by examining the time dependence of the average 
values of physical quantities. In September 1927 Ehrenfest 
[213] showed that average values are described by equations 
of motion similar in form to classical ones (the Ehrenfest 
theorem). 

As quantum theory struck roots the symmetry of quantum 
systems (both spatial and commutational) turned out to be 
of importance (the latter plays a great role in quantum 
statistics, see above). Fundamental results in this respects 
were obtained by Wigner in 1926 [214] and by Weyl in 


86 M. A. Elyashevich 


1927 [215], who used group theory. The latter paper was the 
core of Weyl’s well-known book [216]. 

Applications of quantum theory to atomic systems, in- 
cluding the account of their symmetry, turned out to be 
quite efficient in explaining the patterns established by the 
atomistics of the 19th century (see Sec. 2). A foundation 
was thus formed for a widespread application of quantum 
methods not only in physics but also in related sciences. 
Of especial significance for chemistry is the emergence of 
quantum chemistry, and for crystallography, of the quantum 
physics of the solid state. This is a manifestation of the 
interaction of sciences, as noted in Introduction. 

It should be emphasized in conclusion that the funda- 
mental laws governing the microworld, which were dis- 
covered in the early days of quantum theory, i.e. from the 
middle to the end of 1920s, paved the way for the growth 
of the physics of the nucleus and the physics of elementary 
particles and high energies. 
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The Origins and Development 
of Modern Atomistics 


D. N. Trifonov 


‘If, in some cataclysm, all of scientific knowledge were to 
be destroyed, and only one sentence passed on to the next 
generations of creatures, what statement would contain the 
most information in the fewest words? I believe it is the 
atomic hypothesis (or the atomic fact, or whatever you wish 
to call it) that all things are made of atoms—little particles 
that move around in perpetual motion, attracting each other 
when they are a little distance apart, but repelling upon being 
squeezed into one another’ [1]. 

This statement was made by the prominent physicist 
R. Feynman. And this is scarcely an exaggeration. 

Atomistics—the teaching on atoms (and on molecules, if 
viewed in a wider context)— is rightly regarded by historians 
of science as one of the most important scientific concepts 
that predetermined the vehement growth of natural sciences 
in the 19th century. But whatever its enormous significance, 
the atomistics of the last century was limited in many re- 
spects, and its precepts were no more than hypotheses. Many 
fundamental discoveries were not rigorously justified in 
terms of the then prevalent atomistic notions, e.g. the 
periodic law and the periodic system of the chemical ele- 
ments of D. Mendeleev. The latter noted in 1889: ‘The 
periodic variation of simple and complex bodies is governed 
by some supreme law, whose nature, even less whose cause, 
cannot yet be understood. In all probability it lies at the 
root of the inner mechanics of atoms and particles’ [2]. 

In fact, the reasons behind the phenomenon of periodicity 
could not be understood until the most important ideas of 
atomistics were delineated in a more specific way. But in 
the first place, there was no clear notion of how the atom 
was structured. 
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l. Concepts of Atoms at the End 
of the 19th Century 


Mendeleev in 1892 gave a pithy characteristic of that period 
in the entry ‘Matter’ in Brokhaus and Efron’s Encyclopaedic 
Dictionary [3]. ‘The currently recognized atoms of chemical 
elements are the frontiers of analysis reached by the thought 
of natural scientists.’ He then went on to list the main 
known facts about atoms: 

(1) ‘chemical atoms of each element are invariable’ and 
there exist as many atoms as there are known chemical 
elements (at the time about 70); 

(2) atoms of a given element are the same; 

(3) atoms have weight, and ‘the difference of elements 
from one another reduces at the present state of our know- 
ledge to the unquestionable difference in (relative) weight 
of atoms of various elements’; 

(4) mutual conversions of atoms of a given element into 
atoms of another element are impossible. 

In summary, Mendeleev concluded: ‘Atoms of chemical 
elements remain unknown in essence and constitute just 
a hypothesis.’ He noted that nothing can be said about the 
structure of atoms: ‘Whether these are bodies of a certain 
solid form, for example spherical or whatever, whether these 
are vortex rings, whether they are divisible geometrically, 
whether these are dynamic systems or not—this all is yet 
beyond the present-day inductive techniques of research.’ 
Little can be added to this Mendelcev’s characteristic, which 
is associated with issues of atomistics unsolved at the time. 

The first question was: How many in general are there 
species of atoms? Since the number of species was considered 
to equal the number of chemical elements, the question 
clearly reduced to what finite number of elements occur 
in natural objects. W. Crookes in 1886 reflected about this: 
‘We look at the number of elements, their relative properties 
and ask ourselves: Are these circumstances random or con- 
ditioned by anything? In other words, could be there only 7, 
or 700, or 7000 absolutely different elements instead of 
those 70 (to round off) which we usually recognize now? The 
number is not justified for our mind by any a priori or 
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external considerations’ [4]. Consequently the uncertainty 
of the number of atomic species was another of the features 
of the alomistics at the end of the 19th century. 

Whether atomic weights are integers or not was another 
unsolved problem. Integral atomic weights had been postu- 
lated ever since the Prout hypothesis was made. But measure- 
ments, whose accuracy grew with time, suggested that the 
atomic weights are as a rule not integers. Disputes over the 
Prout hypothesis figured prominently in the history of 
atomistics in the 19th century, but they remained unresolved. 
At the same time nonintegral atomic weights could be 
explained from two assumptions: (1) to select as a ‘unit’ 
of mass not the integral mass of the hydrogen atom (Prout) 
but a fractional quantity of its mass or the mass of the atom 
of some other element (e.g. the hypothetical element helium, 
which was only found on Earth in 1895), (2) to recognize 
that each element may have species of atoms that differ in 
mass (integral). Their different abundances should be re- 
sponsible for the nonintegral atomic weight of an element 
as a whole. Crookes was a proponent of the second assump- 
tion. He proposed to replace the notion of ‘element’ by 
the notion of ‘elemental group’ (as a multitude of the atomic 
species) [5]. Historians of science justly view Crookes’s 
suggestion as anticipating the discovery of isotopy [6]. 
But this opinion turned out to be manifestly untimely. 

To sum up, although the notion of the atom was generally 
recognized and without it any development of natural sci- 
ences would be impossible, in essence it still remained just a 
hypothesis. The concept of atoms was still to shed its ‘struc- 
tureless clothes’ and start a new life in the guise of a specific, 
scientifically justified model—and thereby to start modern 
atomistics. 

The construction of such a model was one of the greatest 
sequels of a revolution of natural sciences that occurred 
in the closing years of the last century. The beginning of this 
revolution is customarily linked with the discovery of 
X-rays, radioactivity, and the electron. With good reason, 
one could add to this list the discovery of inert gases, which 
had a noticeable role to play in the development of the 
theory of atomic structure and in relating it to the properties 
of chemical elements. Lastly, Planck’s theory of quanta 
was also a notable advance. 
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We will now outline the chronology of these fundamental 
discoveries in physics.! 

X-rays were discovered by the German physicist W. Rönt- 
gen on 8 November 1895 in his laboratory in Wiirtzburg. 

The emission by uranium compounds of a new kind of 
radiation, which later (1898) was termed radioactivity, was 
discovered on 1 March 1896 by the French physicist A. Bec- 
querel in the Laboratory of Applied Physics of the Paris 
National Museum of Natural History. 

The credit for the discovery of the electron shared the 
English physicist J. J. Thomson (April 1897, the Cavendish 
Laboratory) and the German physicist E. Wiechert (although 
usually this discovery is ascribed to Thomson only). 

Lastly, the discovery of inert gases was essentially an 
achievement of the English physicist and chemist W. Ram- 
say. Argon was discovered on 3 August 1894, helium on 
25-25 March 1895, krypton on 31 May 1898, neon on 7 June 
1898, xenon on 12 June 1898. 


2. Details of Fundamental Discoveries 
in Physics 


These discoveries have gone down in history as evidence of 
an unparalleled elevation of scientific thought, an upheaval 
that occurred during an extremely short period of time. 
The history of many scientific disciplines knows periods of 
unusually high concentration of spectacular advances. For 
example, for nuclear physics such was the year 1932, dubbed 
annus mirabilis. And the period from 1894 to 1898 
we might call ‘the five years of wonders in the evolution of 
atomistics’. 

This description seems to be well deserved, and is hardly 
in need of dwelling on. What is of value is an examination 
of certain specific features of these discoveries. 

First, they were all results of experimental studies, not 
consequences of theoretical speculations. On the contrary, 


1 In this article we, as a rule, will not for space constraints give 
specific references to individual discoveries. For these references the 
reader is referred to the review article [7]. 
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once there, they became cornerstones of theories, which 
eventually led to cardinal revisions of views of the structure 
and properties of matter. 

Second, these discoveries can be regarded as opportune, 
since there existed premises for their realization. 

Third, the discoveries of X-rays, radioactivity and the 
electron are to a large degree mutually conditioned and con- 
nected. Therefore, an analysis of their history in isolation 
would be not only logically and methodologically incorrect, 
but actually impossible. Only the discovery of inert gases 
seems to stand alone; but its consequences are closely linked 
with the results of investigations into new kinds of radi- 
ation and the evolution of electron concepts. 

Experimental capabilities behind the discoveries of X- 
rays, radioactivity and the electron emerged in the wake 
of two directions of experimental work: cathode rays and 
phosphorescence. As is known Röntgen discovered X-rays by 
observing phosphorescence of a sheet of cardboard impreg- 
nated with barium platinocyanide located near a gas- 
discharge tube. Radioactivity was discovered when A. Bec- 
querel decided to test the hypothesis (false) that phosphores- 
cent bodies emit X-rays; in the process a new kind of radi- 
ation was discovered that was given off by uranium salts. 
The electron was discovered when an attempt was made to 
prove definitively the corpuscular nature of cathode rays 
and to measure the value of e/m for them. 

Inert gases came as a surprise for the following reasons. 
First, the composition of the terrestrial atmosphere was 
believed to be investigated so thoroughly that it seemed 
highly unlikely that there might be some unknown com- 
ponents in it; second, the existence of gaseous chemically 
inactive elements was generally unpredictable from the 
periodic system, as it was stressed repeatedly by Mendeleev 
himself; third, precisely the chemical inertness of argon- 
type gases was in sharp contrast with the then common 
views of the ability of some chemical elements to interact 
with others. But one can see a regularity in the history of 
the discovery of inert gases. After all, the starting point 
of finding argon was the experimentally established fact 
of the difference in densities of the atmospheric nitrogen 
and the nitrogen produced from chemical compounds. The 
advances of spectral analysis and the technology of lique- 
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faction of gases turned out to be important premises for the 
isolation of helium, neon, krypton, and xenon. 

It would be of interest to look at the situation that devel- 
oped in physics by the mid-1890s and that influenced the 
apprehension of the discoveries and their interpretation. 
An important subject of physical exploration were observa- 
tions of phenomena occurring in gas-discharge tubes (cathode 
and canal rays), and attempts to justify the corpuscular 
nature of electricity. Such a ‘background’ was behind the 
enormous interest in X-rays, especially in connection with 
their specific features. Amidst that ‘ray boom’ the discovery 
of ‘uranium’ rays, by contrast, attracted little attention; 
also their nature was initially interpreted incorrectly by 
Becquerel himself (an emission of invisible rays by phos- 
phorescent bodies). It was not until some time later (1898) 
that M. Curie put forward the assumption that radioactivity 
is a property of the atom. The discovery of radioactivity in 
thorium, the discovery of polonium and radium provided 
grounds for considering this phenomenon as inherent in the 
atoms of the heaviest (in atomic weights) elements. This was 
a welcome addition to the atomistic thinking: the 
ability of atoms of large mass to emit rays of a ceftain na- 
ture. 

As early as 1891 the Irish scientist G. Stoney coined 
the name ‘electron’ for defining the elementary electric 
charge, thus recognizing the corpuscular nature of electri- 
city. The electric ‘corpuscle’ turned up in flesh and blood 
in the work of J. J. Thomson, who calculated its mass to be 
1/1845 of that of the hydrogen atom. But in his speculations 
the scientist went even further: cathode rays are a new 
state of matter that differs by its nature from the gaseous 
state ...; in that new state matter is a new substance from 
which all the chemical elements are constructed [8]. Accord- 
ing to Thomson, electrons appeared to be a sort of building 
blocks in atomic structures. This conclusion dealt a blow on 
classical atomistics in ils most conservative part: the atom 
consists of smaller fragments, it is divisible, i.e. not un- 
changed. On the other hand, Thomson’s idea as it were 
breathed new life—at a new level—into the Prout hypothesis 
about ‘protil’. Thomson then asked the question of why the 
atoms of one element differ from those of another. He thought 
that the difference resides in the different numbers of elcc- 
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trons and their ‘arrangement’. This aspect will become 
a subject of our discussion later. 

We have already mentioned that the discovery of inert 
gases was a surprise. They soon gained recognition as inde- 
pendent chemical elements and were placed in the periodic 
system, comprising a group of their own, the zero group. 
But they defied all attempts of chemists to study them, 
since they were unable to enter into chemical compounds 
with other elements. Their atoms were ‘on their own’— 
even their molecules were monoatomic, unlike other gaseous 
elements. Atomistic thinking was thus enriched with another 
piece of evidence: the atoms of the elements belonging to 
a certain group (the inert gases of the atmosphere) cannot be 
involved in chemical interactions. This thesis was only 
partially disproved in the early 1960s by synthesizing chemi- 
cal compounds of xenon, krypton, and radon. 


3. Main Paths of Modern Atomistics 


In this section we will attempt to analyze the ways in which 
the implications of the fundamental discoveries in physics 
prompted the refinement of modern atomistics. A detailed 
analysis of this issue would require a book, though. There- 
fore, we will here only give some highlights, the three main 
lines of evolution of modern atomistics: (1) the construction 
of a model of the atom; (2) evidence for changeability of 
atoms; (3) evidence for the existence of species of atoms of 
chemical elements. 


3.1 The Construction of a Model of the Atom 


The proof of the existence of the electron—the tiniest par- 
ticle that carries a negative electric charge—brought a 
measure of realism into probing into the structure of the 
atom. Ever since the closing years of the 19th century the 
atom, to coil a phrase, was trying on one ‘electron attire’ 
after another. But to stick to some satisfactory model 
researchers were lacking one important detail: the idea of 
the carrier of positive charge to neutralize the negative 
charges of electrons. The then current views provided no 
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answer to the question. Nevertheless, electron models of 
the atom kept appearing one after another. Their list in 
chronological order is as follows: (1) the Kelvin model 
(1902)—the electrons are arranged in a certain manner with- 
in a positively charged sphere; (2) the F. Lenard model 
(1903) —the atom consists of ‘doublets’ of negative and 
positive charges (the so-called dynamids), (3) the H. Na- 
gaoka model (1904)—the atom is organized just like the 
planet Saturn (a positively charged body is surrounded by 
rings consisting of negatively charged electrons), (4) the 
J.J. Thomson model—within a positively charged sphere 
spinning electrons are arranged in one plane on concentric 
shells that contain different but finite numbers of electrons. 
These models were a result of theoretical (often purely 
mathematical) constructs and were formal in character, 
because they failed to correlate the structure of atoms 
with the properties of elements. An exception was J. Thom- 
son’s model. He undertook the first attempts at an explana- 
tion of the periodic variation of the properties of chemical 
elements by linking the periodicity with the regular vari- 
ation of the number of electrons in concentric ‘rings’, or 
to use his terminology in ‘corpuscular groups’. Discussing 
the various configurations of ‘rings’ in his model Thomson 
indicated that the stability of these configurations is period- 
ically related to the number of electrons. Put another way, 
Thomson’s views harboured the extremely important con- 
clusion that the place of an element in the periodic system 
is determined by the arrangement of electrons in the atom, 
which in the final analysis was not far from truth. 

But the exact numbers of electrons in atoms were yet 
unknown. Considering that the electron mass was insignifi- 
cant, one might suppose that they were fairly large, but 
then clearly the question of the mass of the carrier of positive 
electricity presented itself. Thomson believed that the 
carrier of a unit positive charge is far larger in mass than 
the carrier of a unit negative charge, and this too turned 
out to be the case. 

The electron fairly quickly exhausted its potentialities 
as the only ‘building block’ of atoms; but all the atomic 
models listed above had had their part in setting the scene 
for the future planetary model of the atom. Nearly each 
of them contained a measure of reality, even the Lenard 
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model (since estimates made from it suggested that most 
of the volume occupied by the atom is emptiness), and even 
the Nagaoka model (the ‘Saturnian’ atom). 

The Rutherford model became possible because of new 
evidence that came from radioactivity studies, not so much 
of the phenomenon itself as the action of the alpha-particles 
produced in a radioactive decay on various substances. 
It was an analysis of the scattering of alpha-particles by 
various materials that enabled Rutherford in 1911 to sug- 
gest his idea that the atom has a massive charged body, 
the nucleus (the term was introduced by Rutherford in 
1912). 

Attempts to determine the number of electrons in the 
atom and the charge of its nucleus at first gave rise to the 
idea that for light elements the charge Z was about half 
the atomic weight. An important part was also played by 
A. Van den Broek’s hypothesis (1913): the nuclear charge 
is equal to the atomic number of a corresponding element 
in the periodic system. It won general acceptance immedi- 
ately. Van den Broek also suggested that the nucleus con- 
sists of protons and electrons. Both assumptions were used 
by F. Soddy, who introduced the notion of isotopy (for 
tadioactive elements) and promoted the displacement rule 
to the*status of law (see below). Better understanding of the 
structure of the atom had implications for the two other 
lines of evolution of modern atomistics: evidence for change- 
ability of atoms and evidence for the existence of spccies 
of atoms of elements. Van den Broek’s thesis linked directly 
the Rutherford model with the periodic system. It was con- 
firmed experimentally by H. Moseley (1913-14), who examined 
patterns of characteristic X-ray spectra. It was here that 
X-ray studies contributed to the development of novel 
atomistics. 

So atomistics was enriched with several fundamental 
points: (1) the atom with the smallest nuclear charge ap- 
peared to be the hydrogen atom (the lower boundary of the 
periodic system was thus established), (2) the number of 
elements from hydrogen to uranium is limited; (3) there 
are ‘gaps’ in the sequence of elements, which correspond 
to yet undiscovered elements (with Z = 43, 61, 72, 75). 
Moseley attached a physical meaning to the periodicity 
law with the result that its formulation was changed (the 
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properties of elements were now stated to vary with Z, 
not with the atomic weight). 

N. Bohr (1913) applied quantum theory to the Rutherford 
model to account for its conflict with classical electrodynam- 
ics. And so Bohr’s interpretation of the Rutherford model 
became the key concept of new atomistics. Bohr was also 
instrumental in developing the principles behind the elec- 
tron shell configurations of atoms. This made it possible to 
account for the physics of the periodic variation of the 
properties of elements and to advance a formal theory of 
the periodic system. 

Throughout nearly two decades reigned the proton-elec- 
tron model of the nucleus. Wrong in its essence, it neverthe- 
less was no hindrance to the wide dissemination and use 
of the classical atomic model as a whole. The chemical, 
optical and X-ray characteristics of elements were in the 
final analysis unaffected by the details of the structure of the 
nucleus. Even isotopy in stable elements was discovered 
and studied in accordance with the proton-electron nucleus. 
Only when J. Chadwick discovered in 1932 the neutron 
came the present-day concepts of the proton-neutron nucleus. 

In summary, a consequence of the fundamental physical 
discoveries at the end of the 19th century was the con- 
struction of the model of the atom as a whole. Instead of 
the ‘structureless’ atom came a new atom as a complex 
system of elementary particles. 


3.2 Evidence for the Variability of Atoms 


Two aspects are to be distinguished here: (1) proof of spon- 
taneous transformability of atoms (elements) and (2) proof 
of artificial transformations of atoms. 

It is normally maintained that the proof of transforma- 
bility of elements as a result of natural radioactive decay 
was essentially a defeat of the concept of the indivisible, 
invariable atom. But all depends, however, on what is read 
into these terms. 

When J. J. Thomson envisaged electrons as a certain pri- 
mary matter of which atoms of all elements are made up, it 
followed that the very existence of electrons suggested that 
the atom is divisible into lighter constituent parts. Viewed 
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in that sense the Thomson idea deprived the term ‘atom’ 
(indivisible) of its meaning, because by disintegrating into 
electrons it would lose its individuality. The picture looks 
different if we approach the divisibility from the point 
of view of the nuclear model. We can keep stripping the 
atom of one electron after another and thus obtain ever 
more strongly charged ions. Or the atom may even lose all 
of its electrons, but this would not change the nucleus, 
which retains a certain charge, and which is eventually 
responsible for the properties of the given atom (element). 
Therefore, the fact that the atom can lose some electrons 
implies to a certain degree the ‘mechanical’ divisibility of 
the atom, i.e. separation of the atom’s components without 
its losing its key entity, the nuclear charge. It is the value 
of Z that in a way determines the electron configuration of 
the atom. Only when the nucleus undergoes some changes, 
can we speak of real variability of the atom as a material 
system as a whole. 

Historically, atomic transformabilities were established 
at a time when the current conceptions of atomic models did 
not yet warrant any definite conclusions concerning the 
mechanism of the transformabilities, and so the latter did 
not get any rigorous physical interpretation. Transforma- 
bilities were judged from chemical and spectroscopic obser- 
vations, i.e. from observations that, in modern thinking, 
are only associated with changes in electron shells of atoms. 
But in reafity here occurred nuclear transformations. 

The idea that the atom is transformable emerged in 
actual fact with the assumption that radioactivity is a 
property of the atom. M. Curie wrote in July 1900: ‘The 
state of radioactive matter is not a conventional chemical 
state, its atoms are unstable, they emit particles smaller 
in size than the atoms. The atom, indivisible from the 
chemical viewpoint, is here divisible ... Consequently, radio- 
active matter undergoes chemical transformations, which 
serve as a source of the emitted energy, but these are not 
just common chemical transformations, since the latter 
leave the atom unchanged...’ [9]. 

A thought formulated in such definite tern.s was in need 
of an experimental support, which arrived soon. Decisive 
experiments were largely made possible by the discovery of 
radioactive emanations (thoron and radon), which turned 
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out to be inert gaseous substances. The latter property was 
established because there was already an example of chemic- 
al inertness in argon and its analogues discovered several 
years previously. If the latter had not been discovered by 
that time, an understanding of the true nature of emanations 
would have been difficult and interpretations might have 
taken some wild turn. As we are to see later, the discovery 
of emanations also helped to prove that there may exist 
species of atoms of a given element. 

Further developments are well known. At first E. Ruther- 
ford and F. Soddy in 1902 concluded that there occur natural 
transmutations of elements, and W. Ramsay and F. Soddy 
in 1903 proved, using spectral analysis, that radioactive 
decay of radon produces helium. 

In subsequent years data on natural transformabilities 
of atoms of chemical elements were coming from areas hav- 
ing nothing to do with atomic models. The accumulation 
of information went on in accordance with the current con- 
cepts of atomistics, because it was of importance to estab- 
lish the atomic weight of transformation products. There 
seem to have been about equal contributions of physics 
and chemistry to solving the problem of natural transfor- 
mabilities of elements. Physics took care of the nature of 
alpha-particles (doubly ionized helium atoms) and identified 
beta-particles as electrons. Also the individuality of new 
radioactive substances was established from their radio- 
active characteristics (decay type, half-life) or from their 
spectra. But it was impossible to achieve a final identi- 
fication without an insight into the chemical features of 
an element, although many radioactive substances are so 
active that their chemical nature can only be estimated in 
an indirect manner. Clearly, a ‘matrix’ of sorts used as 
a reference for successive radioactive decays and as a basis 
for conclusions about the nature of resultant substances 
was the periodic system. 

The observed natural radioactive transformations were 
not just some scattered, isolated events, they were rather 
linked to form definite sequences—‘chains’, which are also 
known as radioactive series or families (uranium, thorium 
and actinium families). This brought out the clearly observed 
genetic relation of atoms, which within the framework of 
earlier atomistics had been just a product of hypothetical 
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constructs. This relation was understood and explained in 
comparison with atom model-building and proving the 
existence of atomic species (this is specifically manifested 
in the formulation of displacement law). 

The ‘formation’ of radioactive series was generally com- 
pleted by 1918 (the discovery of protactinium); later only 
one refinement was made—the parent was established for 
the actinium family in 1934 (uranium-235). 

For decay processes the term ‘transformability’ is more 
legitimate, because atoms change spontaneously in nature. 
If a change is artificial, the term ‘transformation’ is more 
suitable, because changes in that case are artificial, they 
are brought about by external interventions. Establishment 
of ‘transformation’ is not directly linked with the proof of 
‘transformability’, but it should be noted that at the turn 
of the 1910s and 1920s there emerged many ideas to ‘pro- 
long’ radioactive series into the region of lighter elements 
of the periodic system. These ideas were partly caused by 
the so-called hypothesis of universal radioactivity that 
stated that all elements are radioactive, but most of them 
only slightly so, whereby their activity cannot be observed 
or can be observed with great difficulty. For the nascent 
new atomistics it was quite tempting to establish a genetic 
relation between all the elements. But these views appeared 
to be unsound; their only positive result was the prediction 
of the existence of atomic species in stable elements. 

Artificial transformations were discovered from insignifi- 
cant fundamental and experimental evidence. Historically, 
one could only mention individual ideas voiced by Ruther- 
ford in April 1914. The starting experimental material in 
this respect came from Rutherford’s co-workers E. Marsden 
and V. Lentsberry. When probing into the interaction of 
alpha-particles with hydrogenous substances the researchers 
found anomalous long-range H-particles. This led them to 
conclude: along with alpha-particles, H-particles are ejected 
by radioactive atoms themselves. In the course of his own 
experiments to support or disprove the hypothesis Ruther- 
ford in April 1919 observed a sharp increase in the number 
of H-particles when nitrogen was irradiated with alpha- 
particles. He assumed that under the action of alpha-parti- 
cles the nitrogen atom undergoes a change accompanied 
with the ejection of H-particles. But the true mechanism of 
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this first artificial transformation—'!N(a, p)';O— was final- 
ly identified by P. Blackett only in 1925. And it is this 
date that concludes the chronology of the discovery of 
artificial disintegration of nuclei. 

An (a, p)-type nuclear reaction was accomplished on sev- 
eral light elements down to potassium. For atoms with 
Z >19 to transform artifıcially, the energy of natural 
alpha-particles was too small. And so the new atomistics was 
enriched by a new thesis: the atoms of light stable elements 
can artificially be transmuted into other atoms with Z larger 
than the original value by one. 

Investigations into artificial transformation processes were 
stimulated by the discovery of the neutron. Also of impor- 
tance was the discovery of so-called artificial radioactivity 
(1934). It was in fact of no less importance than the dis- 
covery of natural radioactivity and it enriched appreciably 
the contents of modern atomistics: the possibility was proved 
of obtaining unstable atoms with any Z, thereby the sense 
of the term ‘transformability’ became wider.’ 

To achieve a quantitative characteristic of natural trans- 
formability of atoms time factor was introduced into atomi- 
stics through the concepts of ‘half-life’ and ‘mean lifetime’, 
or ‘lifetime’. It turned out that lifetimes of various radio- 
active elements range from millionth of a second to tens of 
billions of years. Artificial radioactive isotopes of stable 
and natural radioactive elements are also characterized by 
a wide range of half-lives. The probabilistic nature of 
radioactive decay was first predicted by Rutherford and 
borne out by E. Schweidler in 1905; it became the corner- 
stone in interpretation of natural transformability of atoms. 
But we are unable so far to account for the ‘intimate’ reasons 
why a given atom of a radioactive element decays at a cer- 
tain moment of time, whereas another atom of the same 
element, absolutely identical, remains unchanged much 
longer. Therefore, ‘variability’ of atoms, be it atoms of 
naturally occurring radioactive elements or artificial radio- 
active isotopes, is governed by chance. 


1 It is worth noting that the division of radioactivity into ‘natural’ 
and ‘artificial’ is hardly legitimate, because these two kinds of radio- 
activity are realized through identical mechanisms, whether viewed 
aS a process or as a property (see below). 
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3.3 Evidence for the Existence of Atomic Species 


The idea that elements have atomic species of different 
masses was discussed back in the 19th century [6]. But for 
a long time there had been no theoretical and experimental 
evidence to support this idea. 

Radioactivity studies provided much evidence for this. 
An avalanche of discoveries of new radioactive substances 
brought to a head the need to understand their true nature. 
There was no consensus [6]. Some (e.g. M. and P. Curie) 
considered that all radioactive bodies are elements in the 
common sense of the term. But then the periodic system ap- 
peared to be too ‘rigid’ a structure for them to be satis- 
factorily accommodated. Others (Rutherford and Soddy), in 
contrast, treated short-lived radioactive bodies as special 
material formations, which have nothing to do with true 
elements, and so they referred to them as ‘metabolons’. The 
name did not stick and soon others came on the scene, 
e.g. ‘radioelements’, but their essence again could not be in- 
terpreted unequivocally. The number of radioelements snow- 
balled; it was shown that they are genetically interrelated 
in some way or other (radioactive series), but still the 
situation remained unclear. 

Research revealed that there are radioelements (1) with 
the same atomic weights but different properties and (2) 
with different atomic weights but chemically inseparable. 
A first example was the discovery of emanations: they are 
all inert gases and for them only one place in the periodic 
table was reserved—in the zero group (under xenon). Other 
combinations of chemically inseparable radioelements were 
found; sometimes they had identical optical spectra. Radio- 
elements belonging to these combinations behaved like 
common elements belonging to the corresponding groups of 
the periodic system. This all warranted the assumption 
that chemically inseparable elements can be placed at one 
cell in the system, in certain groups. 

This assumption undermined the principle that each ele- 
ment was to take up a definite place in the system. At the 
same time it led to the assumption that there existed atomic 
species of a new type. As new insights were gained into the 
structure of radioactive series and changes of the chemical 
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nature of radioelements as a result of æ- or B--decays this 
assumption was gaining ground. The empirical displace- 
ment rule actually proved this. Such was the state of affairs 
by early in 1913. 

At this stage much support for the existence of atomic 
species started to come from constructing models of the 
atom, specifically from the idea that the nuclear charge of 
an atom equals the atomic number of the element in the 
periodic table and also from the proton-electron model of 
the nucleus. These concepts were put to work by Soddy, 
who gave a clear formulation of the idea of the atomic spe- 
cies of radioactive elements: (a) the same algebraic sum 
of positive and negative charges in the nucleus, their arith- 
metic sum being different, yields ‘isotopes’ or ‘isotopic 
elements’ and they occupy one place in the periodic table; 
(b) they are identical chemically and similar physically, 
save for a small number of physical properties that are 
linked to the atomic mass; (c) if that atomic charge is 
changed by unity we shift to the neighboring places in 
the periodic system [10]. 

This led to the conclusion that became a valuable con- 
tribution to the evolution of new atomistics. The heavy 
radioactive elements from the end of the periodic system 
(polonium, radon, radium, actinium, thorium, uranium) 
are composed of atomic species that differ in atomic weights 
and radioactive characteristics. That was when the true 
meaning of the term ‘radioelements’ became clear: radio- 
elements are the isotopes of radioactive elements. A signi- 
ficant feature of the history of the discovery of natural radio- 
active elements (except for thorium and uranium) is, there- 
fore, that their individual isotopes (radioelements) were 
successively identified and then brought together into an 
isotope group when the phenomenon of isotopy was estab- 
lished. Its modern interpretation was the result of the de- 
velopment of the proton-neutron model of the atomic nucleus. 

It seemed quite natural to suppose that isotopy is a uni- 
versal phenomenon, i.e. it also covers stable elements. 
Interestingly, already in September 1913 Soddy noted that 
each known element may be a combination of inseparable 
elements occupying the same place in the periodic system. At 
the same meeting where Soddy pronounced those words 
F. Aston, a pupil of J.J. Thomson, delivered his report 
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‘A New Elementary Constituent of the Atmosphere’. It was 
devoted to the existence of two species of neon [11, p. 403] 

It was the first swallow, more discoveries of stable iso- 
topes were to come. The key figure here was J. J. Thomson. 
Having contributed generously to the evolution of electron 
concepts in atomistics, he also started looking for clues to 
prove the existence of atomic species in stable elements. As 
compared with radioactive elements (with each isotope radio- 
actively ‘labelled’, and hence easily identifiable) the situ- 
ation with stable isotopes was more complicated, because 
the isotopes had to be separated by masses. 

The main credit for the discovery of stable isotopes goes 
to Aston. Historically, the search lasted for more than 
30 years (for some stable elements their isotopic composition 
was only finalized in the 1950s), but the overwhelming ma- 
jority; of stable isotopes were discovered during the 1920s, 
which made it possible to draw some conclusions of sig- 
nificance for modern atomistics. 


4. Main Aspects of Modern Atomistics‘ 


The main points of modern atomistics can be laid down as 
follows: (1) the atom is a complex material structure, it 
represents the tiniest particle of the chemical element; 
(2) each element has atomic species (both naturally occurring 
and synthesized artificially); (3) the atoms of one element 
can transform into the atoms of another element, these proc- 
esses occur either spontaneously or artificially (by various 
nuclear reactions). The three points listed cover most of 
the contents of modern atomistics. 

It is worth noting then that the habitual term ‘atom’ now 
became an anachronism, because the earlier notion that it is 
‘unchangeable’ and ‘indivisible’ghas long been disproved. 
That the atom is divisible is an established fact. The divis- 
ibility is defined not only by the fact that,the atom can 
be ‘dismantled’ into ingredients—the nucleus and the 
electrons—but also by the fact that the individuality of the 


1 We of course are not going to cover here all the issues coming 
under the heading of ‘modern atomistics’, and will only confine our- 
selves to its key aspects concerned with the structure and properties 
of atoms as material structures, 
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atom changes as a result ofa wide variety of nuclear proc- 
esses. The individuality of an atom is determined by the 
composition of its nucleus, i.e. by the number of protons 
and neutrons it contains. If the quantity of either changes, 
this atom loses its individuality and a new atom results, 
with new characteristics. If the number of protons Z 
changes, an atom of another chemical element emerges; and 
hence changes the chemical individuality of the atom, and 
so does its physical ‘image’. The latter changes even when 
the number of neutrons N changes, in which case another 
species of the atom is produced with the same Z. In the 
process, the atom retains its chemical properties. 

In summary, the term ‘atom’ itself as viewed from the 
modern angle does not answer the real state of affairs both 
in terms of its structural complexity and in terms of its 
ability to change. Interestingly, one of the originators of 
modern atomistics, Soddy, contemplated a change of the 
term. In 1949 he wrote that after atomic energy had been 
liberated artificially it would be a misnomer to refer to it 
as atomic energy (‘the energy of the indivisible’) since 
strictly speaking the liberation of atomic energy presupposes 
the division of that indivisible entity. And so he suggested 
the term ‘tomic’ instead of ‘atomic’ [12]. 

Soddy reasoned that the word ‘atom’ stripped of negative 
prefix ‘a’ could put things right. Thereby the teaching on 
atoms would become the teaching on toms. But the scientific 
community is characterized by a measure of conservatism in 
that it has a way of sticking to old terminology, and in 
this case conservatism is justified. 

At the same time any scientific discipline in the course of 
its growth requires from time to time perhaps not a change 
but rather an expansion of its arsenal of concepts, or, to 
use modern terminology, of its conceptual basis. In the 
case of atomistic terminology there is an important example 
to this effect. If we consider or correlate atomic species 
whose composition is characterized by various values of Z 
and N, then the phrase ‘to consider or correlate two isotopes’ 
would, generally speaking, be incorrect, since two atoms with 
different values of Z are not isotopic. It only makes sense 
to speak of isotopes when Z is fixed. But this nuance is 
generally overlooked. Meanwhile back in 1947 T. Kohman 
suggested a way to remove this inconsistency by introducing 
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the term ‘nuclide’, i.e. an atomic species defined by the 
composition of its nucleus [13]. This term became later 
quite common. Nuclide is, essentially, nothing else but 
another name for an atom, but at the same time it unequiv- 
ocally emphasizes the presence in the atom of a decisive 
component—the nucleus (a system of nucleons). Therefore, 
the present-day teaching on atoms could in principle be 
called the teaching on nuclides, and atomistics then would 
become ‘nuclidistics’. 


4.1 The Structure of Atoms and the Electron Level 
of Understanding of Periodicity 


Among the simple material structures of nature the atom is 
in many respects a unique entity. It consists of three kinds 
of the so-called elementary particles, the atomic structure 
as a whole being quite stable (within certain limits, of 
course). Science knows of some other ‘conglomerates’ of 
elementary particles, such as positronium, mesoatoms, 
hyperfragments, but they all are rather ephemeral ‘mutants’, 
and their lifetimes are fractions of a second. 

The constituent particles in the atom come from different 
groups of elementary particles: the proton and the neutron 
are baryons, the electron is a lepton. The electron mass is 
known to be 1/1840 of that of the proton, the neutron mass 
is slightly larger than the proton mass. Accordingly, the 
atom’s mass is largely determined by the mass of its nucleus. 
For each atom the sum Z + N determines the mass number 
of the nuclide (A). The isotopes of a given element differ in 
A, and their nuclear masses are also different and noninte- 
gral. 

In the final analysis the very existence of atoms is a re- 
sult of an interplay of four kinds of fundamental interactions: 
strong, electromagnetic, weak, and gravitational. The 
strong interaction is extremely short-ranged (~10-!3 cm) 
and is independent of charge. It is responsible for the nuclear 
forces and nuclear structure. The main domain of the electro- 
magnetic interaction is the electron configurations of atoms; 
it underlies the principles of their structure and hence of 
their chemism. It is 100 times weaker than the strong inter- 
action, but acts over significantly longer ranges. Therefore, 
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the atom as a whole is larger by far (by 10° times) than its 
nucleus. Weak interaction is specific, it determines some of 
the aspects of instability of nuclei (beta decay). As for 
gravitational interaction, its role in the structure of the 
atom is as yet unclear. 

The discovery of the periodic variation of the properties of 
elements with increasing atomic weight was a milestone in 
the evolution of classical atomistics. But the latter was 
unable to account for it (at least in the absence of an atomic 
model). On the other hand, modern atomistics rests on 
a sound atomic model constructed using quantum mechanics. 

His quantum model of the atom allowed Bohr to relate 
the physico-chemical properties of elements to the structure 
of their atoms and to work out a formal theory of the peri- 
odic system. Other contributors to the refinement of the 
theory were Sommerfeld, Lande, Stoner, Uhlenbeck, Goud- 
smit, Swinne and others. As a result, a description was 
produced for the so-called real scheme of formation of elec- 
tron configurations of atoms with increasing Z. This aspect 
of the evolution of modern atomistics has been covered in 
much detail in the literature, and so we will confine our- 
selves to some general remarks. 

As a matter of fact, our understanding of periodicity has 
now achieved a qualitatively new level, the level of atoms 
(or electrons). It turned out that elemental periodicity, 
reflected in Mendeleev’s periodic system, is explained by 
electron periodicity, i.e. periodic recurrence of similar types 
of outer electron shells of atoms with increasing charge. 
The sequence of filling of electron shells and subshells in 
atoms is in turn associated with the concept of four quan- 
tum numbers, which vary within certain ranges and charac- 
terize the location of individual electrons. From the exclu- 
sion principle, due to Pauli, mathematical relationships were 
obtained for capacities of individual shells and subshells. 
The structure of the periodic system is now accounted for 
as a ‘system of periods’. 

In current notation the real scheme can be written as 
follows: 1s?2s?2p°3s?3 p°4s?3d!°4 p%5s?4d!°5 p%6s?4f45d 6 p§7s25ft4 
6d107p° ... . Its characteristic feature is that only for the 
values of the principal quantum number n = 1, 2, the cor- 
responding shells are filled up completely. At larger n shells 
are filled in a stepwise manner, i.e. before a shell with 
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a given n is completed in atomic configurations appear elec- 
trons corresponding to other values of principal quantum 
numbers. For example, if in the argon atom (Z = 18) 
the configuration 3s?3p® has been realized in the potassium 
atom (Z = 19) appears not the 3d- but 4s-electron. System- 
atic ‘intrusion’ of 3d-electrons begins with scandium (Z = 
21). As a matter of fact, this stepwise variation explains 
the structure of the periodic system at the electron level of 
understanding the periodicity. 

In addition to the real scheme there is the so-called ideal 
scheme of formation of electron configurations, according to 
which the formation of each shell, once started, goes on to the 
end, to completion: 15?2s?2p°3s?3p%3d194s?4p®4d94f4 .... 

In certain intervals of Z (1-18 and 29-36) the configurations 
of atoms appear to be the same whatever the scheme. This 
important detail, generally speaking, is a token of an inad- 
equacy of the present-day formulation of the periodic law, 
because, as we have seen, depending on the scheme employed 
the electron configurations of atoms will be different, and 
only in the minority of cases identical. Therefore, a more 
rigorous interpretation of the formulation of the periodic 
law should also consider the periodic variation of properties 
of elements as a function of the structure of the outer elec- 
tron shells of their atoms. 

Although one encounters in the literature statements that 
the Pauli exclusion principle became a sort of conclusion of 
the evolution of the principles of electron periodicity (i.e the 
theory of the periodic system), the real situation is more 
complicated. In fact, it does not provide the opportunity 
to select between the real and ideal schemes. It is worth 
noting that Bohr, when working on the rules of filling the 
shells and subshells in atoms, in the first place drew on 
a body of empirical evidence (reflected in Mendeleev’s 
table) for the variation of the chemical properties of the 
elements, and also on the evidence from the spectra of their 
atoms. At Bohr’s disposal there were no other data for deduc- 
ing the real sequence of filling. Therefore, the appraisal 
of the formal theory of the periodic system made by Som- 
merfeld in 1924 was correct: ‘By and large, this is a com- 
plete and easy interpretation of chemical facts. Basing our- 
selves on the ample spectral evidence we can rely with great- 
er confidence on the qualitative validity of this theory ... 
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although it will perhaps wait for a rigorous treatment for 
a long time’ [14]. 

The formalism of the theory of the periodic system — 
unquestionably one of the greatest generalizations within 
the framework of new atomistics—consisted also in that 
Bohr’s model itself was a marriage of quantum and classical 
postulates: the presence of ‘allowed’ electron orbits rested 
on quantum considerations, whereas the corresponding calcu- 
lations were made using the tools of classical electrody- 
namics. 

A ‘rigorous treatment’, for which Sommerfeld yearned, is 
generally associated with application of quantum-mechani- 
cal techniques to the problem of atomic structure. It is be- 
lieved, not without reason of course, that we now possess 
a quantum-mechanical theory of the periodic system. This 
optimism dates back to Rutherford’s speech delivered in 
1934 on the centenary anniversary of Mendeleev’s birth. 
The luminary of new atomistics said that quantum mechan- 
ics is so perfect a science that the periodic law could be 
deduced from its basic principles. This could be done by 
any good mathematician conversant with the quantum me- 
chanics of the electron and with Pauli’s exclusion principle 
(even if he had no idea of the periodic law) [15]. 

Viewed against this background, the statement of such an 
authority in quantum mechanics as C. Coulson made in 1969 
(on the 100th year anniversary of the periodic law) appears 
to be more reserved. He said that modern wave mechanics 
had not cleared up the details of the periodic system but 
at the same time it could yield qualitative, and sometimes 
even semiquantitative information about some aspects of 
the periodic system [16]. 

An appreciation of the real contribution of quantum me- 
chanics to the construction of arigorous theory of the period- 
ic system would require a detailed treatment of the prob- 
lem, which would go far beyond the scope of this article 
(see below). This problem is considered at length elsewhere 
[17, 18]. But in any event quantum mechanics turned out to 
be that very ‘internal mechanics of atoms and particles’, 
which, as Mendeleev expected, would penetrate into the 
deep recesses of the phenomenon of the periodicity. 

Using various approximations and fast computers it is 
now in principle possible to compute Lhe electron conligura- 
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tions of atoms with large Z, also those of yet unknown ele- 
ments (see below). Generally speaking, a success of such 
computations would imply a theoretical derivation of the 
periodic system, but (and this is a very important ‘but’) of 
the periodic system of atoms, not elements. M. Haissin- 
sky [19,p.120] maintained: ‘It is not always sufficient to 
have a distribution of peripheral electrons to characterize the 
chemical behavior of elements and to establish their place 
in the periodic system’, and there can hardly be any objec- 
tion to this. It would be pertinent to cite here G. Seaborg’s 
words. Commenting on the chemistry of superactinides and 
predicting that the chemistry may turn out to be exceedingly 
complicated, he said that one could assume that the location 
of such elements in the periodic system would be determined 
by their atomic numbers rather than by their chemical 
properties [20]. 

These speculations suggest the conclusion about the true 
role of the concepts of new atomistics played in interpreting 
the periodicity. Namely, an electron-level account of this 
phenomenon appears to be necessary, although insufficient, 
condition for a fundamental qualitative interpretation of 
the periodic system of the elements, because the latter is 
a much more profound generalization than the periodic 
system of atoms. 

Also there are some unsolved questions concerned with 
the completeness of the theoretical justification of the 
real scheme of fashioning the electron configurations of 
atoms (for the moment we ignore the structure of hypothetic- 
al atoms with very large Z). On the whole, nobody now calls 
in question the general order of filling the atomic orbitals 
(the term ‘atomic orbital’ (AO) is part and parcel of the 
quantum-mechanical parlance). But why does nature favor 
the real scheme of filling AO’s, not some other? 

The ‘ideal’ sequence of filling AO’s is disrupted after 
the electron configuration of the argon atom with Z = 18 
has been reached (15?2s?2p#3s?3p®). In the potassium (Z = 19) 
and calcium (Z = 20) atoms 4s-electrons appear, although 
ten more places are still vacant in the 3d-AO of M-shell. 
A trivial explanation of the ‘premature’ appearance of 4s- 
electrons in potassium is based on the assumption that the 
energy of 4s-AO (e,, = —0.14748 a.u.) is smaller than the 
energy of 3d-AO (ea = —0.05815 a.u.). Other cases of 
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deviations from the ‘ideal’ order in other regions of the 
periodic system are interpreted in an analogous manner. 
The real course of filling AO’s is viewed as being entirely 
due to the course of the variation of their energies. For 
a number of reasons this interpretation can hardly be called 
convincing. It is unclear, for example, why in the scandium 
atom (Z = 21) 3d!4s? is realized instead of 3d%, although at 
this Z the energy €q is already lower than &,,. 

Therefore, another approach to accounting for the filling 
sequence with increasing Z deserves attention. According 
to I. Dmitriev: ‘Actually the assigning (in theory) to an 
atom of one configuration or another is based not on the 
relative values of orbital energies and on the filling with 
electrons of the lowest single-electron levels, but on the 
requirement of a minimum of the total energy of the atom 
(Exot), Which does not reduce just to the sum of single- 
electron ones... 

‘In the K atom (in its ground state) occurs the configura- 
tion [Ar] 4s!, not [Ar] 3d!, not because e,,< &34, but because 
in the first case the total Hartree-Fock energy of the atom 
is smaller than in the second: 


[Ar ]4s! Etot = — 599.16479 a.u. 
[Ar ]3d! Erot = — 599.07571 a.u. 


Similarly, with Sc in the ground state 3d-AO is filled not 
because &,, > Ega > E4s (in reality esq < &,, in the ground 
configuration), but because £,,,([Ar] 3d!4s?) ='—759.2603 
a.u. < Exı(lArl3d?4s!) = —759.21345 a.u.” [18]. 

This approach seems to make sense. But a question pres- 
ents itself, a question of immense importance for an under- 
standing of profound reasons behind the periodicity at the 
electron level: why do minima of the total energy correspond 
to atoms with distinct values of Z, such that prevail in the 
construction of the real scheme of formation of atomic elec- 
tron configurations? There is no answer to the question yet, 
and this seems to be an unsolved problem of atomistics. 

In the last 10-15 years there have been many studies of 
the mathematical modelling of the periodicity based on the 
concepts of set, number and group theories (for the biblio- 
graphy see [18]). The most interesting results have been 
obtained in terms of group-theoretical models. Examination 
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of these results suggests, in particular, that the classifica- 
tion of atoms of chemical elements based on symmetry 
considerations is irreducible to purely spatial considera- 
tions. The periodicity reflects the dynamic and commuta- 
tion symmetries of atoms that manifest themselves in 
single-electron properties [21]. But the group-theoretical 
model does not give a definite answer to the above question. 

Early in this section we noticed one unique feature of 
the atom as a complex material structure. There is another 
one, no less unique: since atomic electrons are thought of 
as moving in the spherically symmetric nuclear field, the 
atom is characterized by spherical spatial symmetry. This 
‘geometric image’ of the atom constitutes one of the most 
important triumphs of modern atomistics. 


4.2 Atomic Species and the Nuclear Level of 
Understanding of Periodicity 


The number of nuclides known as of now is about 2,000. 
They can be clearly divided into three groups: (1) stable 
nuclides—isotopes of naturally occurring stable elements; 
(2) natural radioactive nuclides—members of radioactive 
families of thorium-232, uranium-238, and uranium-235; 
(3) artificial radioactive nuclides, synthesized using various 
nuclear reactions. 

We can now safely state that all stable nuclides have 
already been discovered and they are about 280 in number. 
The word ‘about’ here is used because some of them in 
reality exhibit a weak radioactivity with extremely large 
half-lives. Examples are potassium-40, vanadium-50, rubi- 
dium-87, indium-115, antimony-123, lanthanum-138, ceri- 
um-142, neodymium-144, samarium-147, lutecium-176, tan- 
talum-180, rhenium-187, platinum-190, bismuth-209. But 
their lifetimes are so long that they might as well be re- 
garded as stable. 

When in the 1920s, largely due to Aston, stable isotopes 
were discovered en masse, hardly anybody doubted that all 
stable elements have atomic species. But the real picture 
appeared to be different: about 25% of stable elements are 
monoisotopic, i.e. they are represented in nature by a sin- 
gle species. Such are Be, F, Na, Al, P, Sc, Mn, Co, As, Y, 
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Nb, Rh, I, Cs, Pr, Tb, Ho, Tu, Au, Bi. Except for berilli- 
um-9 all the nuclides just listed have odd values of Z. 
Therefore, (for stable elements) the thesis of classical at- 
omistics that the number of atomic species that occur in 
nature equals the number of elements can be considered refut- 
ed only partially. Other elements with odd atomic numbers 
have no more than two stable isotopes. Conversely, some even 
elements are represented by a large number of stable atomic 
species (potassium has 6 isotopes, krypton—6, cadmium—8, 
tin—10, tellurium—8, xenon—9, barium—7, and so on). 
Modern atomistics has not yet explained this ‘scatter’ of the 
numbers of existing stable species for odd and even Z. It is 
only interpreted within individual semiempirical regulari- 
ties associated with some specific features of nuclear struc- 
tures. Say, the abundance of stable isotopes in calcium and 
tin is accounted for by the fact that the nuclei of these atoms 
have magic values of Z. 

Radioactive nuclides that enter into radioactive families 
are also actually exhausted (there are 46 of them). It can 
only be admitted that some new radioactive branchings 
may be found (at one time this resulted in the discovery of 
individual isotopes of francium and astatine). The families 
of thorium-232, uranium-235 and uranium-238 have differ- 
ent numbers of members (13, 15 and 18, respectively). 
Each family has its structure because of the different length 
of ‘links’ of successive a- and ß-decays and the presence of 
radioactive branchings in individual representatives of 
the families. These characteristic features in turn are in need 
of a more rigorous treatment than the one available. 

These two groups include virtually all nuclides that occur 
in nature. The list can only be augmented by several radio- 
active atomic species that continually form as a result of 
natural nuclear reactions (e.g. tritium and carbon -l4). 

One important aspect of the multitude of natural nuclides 
is their different origins: they are clearly divided into pri- 
mary and secondary. 

The primaries are all the stable nuclides within the in- 
terval Z = 1-83, and also the radioactive thorium-232, 
uranium-235, uranium-238, whose half-lives are compatible 
with the age of the Earth. All these nuclides formed in the 
process of nucleosynthesis in stars before the Solar system 
formed. Nuclides with shorter half-lives had decayed earlier 
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in its life. Therefore, all the nuclides that enter into 
natural radioactive families (save for their ‘ancestors’ and 
the concluding stable isotopes of lead) are secondary (iso- 
topes of polonium, radon, radium, actinium, protoactinium, 
astatine, and francium) and they occur in the Earth’s crust 
in trace concentrations. With a certain reservation classed 
with secondary natural nuclides can be the products of 
natural nuclear reactions, such as some isotopes of techne- 
tium (Z = 43) and promethium (Z = 61), and also neptuni- 
um-237 and plutonium-239, which are available in tiny 
amounts. Concerning technetium and promethium, one 
should notice that this is a special case, not without interest 
for present-day atomistic conceptions. The series of stable 
atomic species for Z = 1-83 happens to be not continuous: 
at two points it has gaps, which was long a bewilderment, 
since there was no way of filling in the gaps between molyb- 
denum and ruthenium and between neodymium and sama- 
rium. The absence of stable isotopes of technetium and 
promethium was only given a rigorous theoretical explana- 
tion in the 1950s. 

Lastly, the largest group of currently known nuclides are 
artificial radioactive nuclides—about 1,600, and this number 
naturally has a tendency to grow. Radioactive isotopes have 
been synthesized for all naturally occurring elements. In 
addition, more than 150 nuclides with Z >92 were obtained 
artificially, they come under the heading of transuranium 
elements (with Z = 93-109), the overwhelming majority of 
artificial radioactive nuclides being short-lived. As we have 
noted above, the discovery and investigation of radioactiv- 
ity brought into atomistics the utterly important factor 
of time. But no quantitative laws have been found so far 
that govern the variation of half-lives of radioactive isotopes 
for some or other of the values of Z. This problem appcars 
to be of global significance, but there are no clues to its 
solution. Likewise, there is no more or less definite estimate 
of the limit of the existence of atomic structure of matter 
(see below). 

The discovery of isotopy was largely responsible for the 
shaping of modern atomistic concepts. Of specinl importance 
was the initial discovery of isotopes in stable elements, 
because this made it possible to solve one of the most impor- 
tant problems of atomistics—to understand the nature of 
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atomic weights of elements and to account for their being 
not whole numbers. 

In 1920 J. Chadwick performed experiments to find direct- 
ly the numerical values of nuclear charges for several ele- 
ments and his results coincided with the predictions from 
the Moseley law. These experiments concluded the rigorous 
proof of the monotonic growth of Z with an increment one 
in passing from one element in the periodic system to the 
next. It became thus absolutely clear that the sequences 
of elements in increasing order of their atomic weights, on 
the one hand, and of their nuclear charges, on the other 
hand, are essentially symbatic. There are some well-known 
anomalies (Ar-K, Co-Ni, Te-I) in which the atomic weight 
of the first element in each pair is larger than of the next. 

The reasons behind these anomalies were understood after 
isotopy had been found in stable elements and the abun- 
dance of stable isotopes for agivenZ had been determined. It 
was generally revealed that in ‘anomalous pairs’ the heaviest 
isotopes of elements with lower Z are the most abundant. 
In fact, the abundance of *Ar is 99.6%, whereas in the 
group of natural isotopes of potassium the maximum lies at 
39K (93.08%). In the Co-Ni pair there is only one stable 
isotope of cobalt—**Co (100%)—and the heaviest of the 
nickel isotopes—**Ni—has an abundance of 67.76%. With 
Te-I (31.79% for !Te and 34.49% for !?’Te against 100% 
of 127I) the situation is similar. It is much more difficult to 
account for the origins of these anomalies. A satisfactory 
explanation can only be given for the Ar-K pair, assuming 
that the entire “Ar contained now in the terrestrial atmo- 
sphere is radiogenic, i.e. it is a product of the radioactive 
decay of *°K, which by the way accounts for the extremely 
low abundance of the latter. It is quite possible that earlier 
in the evolution of the Earth this anomaly was not there. 
As for the prevalence of heavier isotopes of Co and Te, there 
are several explanations, none of them watertight, however. 

The anomalies had no effect on the discovery of the peri- 
odic law, nor on the development of the periodic system of 
elements, although they presented some difficulties. They 
appeared to be too few to disturb somehow the general pat- 
tern of the variation of the behavior of elements, i.e. the 
periodicity principle established by Mendeleev. But what 
about the very fact that anomalies are so few—perhaps it 
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represents some, yet unknown, feature of the material 
world? The question presents itself: Were there initially 
more anomalies? Suppose that this was really the case. The 
hypothetical series of the chemical elements lined up in 
increasing order of their atomic weights could then differ 
markedly from the habitual natural series of elements. And 
so markedly that this would obliterate any regularities of 
the variation of their chemical properties (as it was the case 
in reality). In other words, neither in 1869 nor much later 
a periodic law would have been discovered (let alone a possi- 
bility of any rational systematization of elements). Atomis- 
tics would have taken a different path. Admittedly, sooner 
or later scientific thought would have produced an atomic 
model and the elements would have been arranged in a natu- 
ral series in increasing order of Z, but this sequence, being 
a monotonic one, would have been in sharp contrast with 
the sequence of variation of atomic weights of elements. 
This is just a speculation, however. Stable nuclides are 
distributed over their mass numbers and abundance (for 
certain Z) in such a way that a possibility of the existence 
of anomalies other than those mentioned above is excluded. 
We just have not found yet a clear answer to the question of 
why precisely these nuclides, not others, appeared to be 
stable. The material out of which the Solar system was 
formed initially contained an enormous amount of stable and 
radioactive nuclides. In the end all the radioactive nuclides 
have decayed away and turned into stable nuclides (or rather 
all the short-lived radioactive nuclides), which of course 
could not but exert an influence on the abundance of stable 
nuclides. But the picture, it seems, was dominated by a law 
that in modern theoretical nuclear physics is known as the 
Shchukarev-Mattauch rule [22]. It states that if two isobars 
differ in Z by 1, then one of them must be unstable. It was 
this rule that predetermined those intervals of the mass 
numbers of stable isotopes which, in the final analysis, gov- 
ern the natural sequence ofthe atomic weights of elements. 
Abundances of individual stable isotopes of elements are 
more difficult to interpret. But one point is not without in- 
terest: if for each element we align the nuclides with the 
greatest abundance, the series would show the same anomalies 
as in the natural series of elements lined up in the increas- 
ing order of Z: Ar-K, Co-Ni, Te-I. It seems to have happened 
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so that the realization of the natural series of elements with 
ever increasing atomic weights was prepared first by the 
strong interaction (the reactions of nuclear fusion in stars, 
a-decay, spontaneous fission) and the weak interaction 
(B-decay). It is here that the nuclear basis of the periodicity 
stands out. 

After many isotopes of stable elements had been discov- 
ered, it became tempting to pose the question of whether or 
not Mendeleev’s periodicity principle covers the multitude 
of isotopes. This is exactly the way S. Shchukarev formu- 
lated the question. He stated that ‘the periodicity is 
a property inherent in the atom’s nucleus itself’ [23]. But in 
the 1920s it was fairly difficult to justify this point. An 
explanation of the causes of the elemental periodicity 
has to await the coming of the atomic model. Likewise, to 
bring out the nuclear periodicity required some justified 
concepts of the nuclear structure. But even when the proton- 
electron model of the nucleus prevailed, assumptions were 
made that the nucleus, like the atom, had a shell structure, 
although this idea was not developed further. The ideas of 
the nuclear causes of the periodicity were really striking 
roots during the 1930-40s. This was largely due to the 
emergence of the proton-neutron model of the nucleus, the 
application of quantum-mechanical ideas to the description 
of nucleon states and the construction of the so-called shell 
model of the nucleus. The core element of this model was the 
concept of the magic numbers of nuclear protons and neut- 
rons: Z, N = 2, 8, 20, 50, 82, N = 126. It was postulated 
that in nuclei with these values of Z and N definite nucleon 
shells are filled up. Established at first empirically, these 
magic numbers were later supported theoretically. Signifi- 
cant contributions to the theory of nuclear periodicity were 
made by M. Göppert-Mayer and J. Jensen [24]. 

So far many systematizations of nuclides have been sug- 
gested (some of them are even termed ‘periodic systems of 
isotopes’); all of them in some way or other rely on the 
shell model of the nucleus. One version was due to I. Seli- 
nov [25]. In terms of the information content such systems 
of classification are still inferior to the periodic system of 
elements. Also the principles behind them cannot yet be 
considered well established. The forms of reflection of the 
electron and nuclear periodicity can hardly be given in 
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a strictly adequate manner, based on the facts of the succes- 
sive construction of electron and nucleon shells and sub- 
shells. What is more, we do not yet know all the nuclides 
that are somehow able to exist. In general, this difficulty 
is of the same sort as the ignorance of the total number of 
elements for the periodic system in the days of classical 
atomistics. 

For several decades now discussion flares up from time to 
time of whether or not there exists (or can exist) a correla- 
tion between the electron and nuclear periodicities. Put 
another way, the question will be: Does the mass of an atom 
have an influence on the character of filling the electron 
configurations? We should not perhaps be too quick with 
a negative answer, although there are also no grounds as yet 
for a positive one. Meanwhile there are hints of some 
‘correlation’ line. The above-mentioned Göppert-Mayer and 
Jensen introduced the concept of the so-called spin-orbit 
interaction, which is responsible for the splitting of indi- 
vidual nuclear subshells into two sublevels which are filled 
with nucleons not in succession but with gaps to construct 
other subshells. It was this concept that enabled the magic 
numbers to be found theoretically. On the other hand, 
computer-assisted studies of the ground electron configura- 
tions of the atoms of hypothetical elements with Z > 
120 show that individual electron subshells must also 
suffer a similar splitting (more details are given in the 
reviews [26, 27]). This suggests that in the region of very 
large Z the electron periodicity shows some effects inherent 
in the nuclear periodicity. Effects of this sort, called rela- 
tivistic ones, are now widely employed to interpret some 
properties of elements with large Z. 


4.3 The Variability of Atoms 


According to the ideas of modern atomistics atoms can 
change in two ways: by radioactive decays and a variety of 
nuclear reactions. 

The kinds of radioactive decays are relatively few in 
number, and there is no telling now whether more of them 
will be discovered. The commonest kinds are a- and B-- 
decays. They cover more than 80% of the known radioactive 
nuclides. Essentially all natural nuclides undergo one of 
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these decays. The lightest B--active nucleus is *He; further 
a weak ß--activity has been found in *°K, another of the 
light elements. With natural nuclides a-decay appears at 
first for rare earth elements. All natural isotopes of elements 
with Z >80 that are members of radioactive families 
undergo a- or B--decays. The above explains why the emis- 
sion of a-particles and electrons has been discovered first. 
Only in 1940 another form of variability was found, which 
is observed in natural radioactive nuclides: the rare process 
of spontaneous fission, inherent only in uranium and thori- 
um. As Z increases this kind of radioactive decay becomes 
ever more probable for individual isotopes of transuranium 
elements. 

Other now known kinds of radioactive decay were only re- 
vealed after artificial radioactivity had been discovered; they 
are essentially inherent in artificial radioactive nuclides. 
These primarily include the positron, or ß*-decay (I. and 
F. Joliot-Curie, 1934), X-capture (the capture by the nucleus 
of an electron from the K-shell), which was first predicted 
theoretically and then found experimentally in 1938 by 
L. Alvarez. Now emission of an electron and positron, and 
K-capture (and rarely L-capture) are commonly viewed as 
forms of the general process of ß-decay. 

!Also predicted were some other kinds of radioactive de- 
cays. So in 1935 Géppert-Mayer assumed that the so-called 
double ß-decay, or 2ß-decay is possible (simultaneous emis- 
sion of two electrons or two positrons), but it has not been 
observed yet experimentally. Nevertheless, the supposition 
of 2ß-stable nuclei is used by theoreticians, in particular 
in the context of nuclide systematization. On the contrary, 
the predictions of the proton p-decay (B. Dzhelepov, 1951) 
and double-proton 2p-decay (V. Goldanskii, 1960) have 
been supported experimentally, although they are exception- 
ally rare processes, characteristic of individual nuclides 
with a very large neutron deficit. 

Exceedingly rare processes also include the emissions of 
delayed neutrons, protons, °H and *He nuclei, and the so- 
called spontaneous fission of nuclei from an isomeric state. 
These processes have been discovered by Soviet scientists. 

In the late 1960s the Soviet scientist Yu. Shukolyukov 
discovered anomalies in the isotopic composition of neon 
and argon in some uraniferous minerals. He concluded that 
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these anomalies cannot be explained by supposed natural 
nuclear reactions and made the assumption: ‘Along with 
two familiar competing processes of decay of heavy nuclei— 
a-decay and spontaneous fission into two heavy fragments— 
it can be assumed that there also exist radioactivity of the 
type such that nuclei emit particles intermediate in mass 
between an alpha-particle and a heavy fragment: *4Ne, ??Ne, 
38Ar, Ar, and so on. The rate of this radioactive transforma- 
tion is determined by the penetrability of the potential 
barrier of the disintegrating nucleus and it must be lower 
than the rate of the a-decay and higher than the rate of the 
spontaneous fission’ [28]. A theoretical analysis of the new 
kind of radioactive decay was given in 1977-80 by the 
Roumanian physicists A. Sändulesku, D. Poenaru, M. Iva- 
sku and the German physicist W. Greiner [29]. These re- 
searchers reasoned along two possible lines: (1) an extremely 
asymmetrical fission, or (2) emission of so-called nucleon 
clusters. It was also shown that the probability of the new 
kind of radioactivity must be sensitive to shell effects. 
Therefore, a resultant product may be the doubly magic 
nucleus *8Pb or its neighbors, and the ‘clusters’ may be 
neon-24, magnesium-28, silicon-32, calcium-48, and others. 
Sändulesku suggested that this kind of decay be called 
‘magic radioactivity’. 

Experimental attempts to detect the new kind of radioac- 
tivity have been made since 1980. They were crowned with 
unquestionable success late in 1983. And in January 1984 in 
Nature H. Rose and G. Jones of Oxford published a paper 
‘A New Kind of Natural Radioactivity’ [30]. They reported 
observations of the emission of carbon-14 by radium-223, 
with the formation of lead-209. They estimated that each 
40° acts of a-decay of radium-223 yielded only one emission 
of a carbon nucleus. Rose and Jones proposed the following 
probable processes: 

13¢ 15 

223Ra ——> 210Ph, 227Ac —— 210Ph, 
18 196 

227Th — — 209Ph, 27Th —— 208Ph, 


Supporting evidence came in 1984 from France and the 
USSR. Up until now the new kind of radioactive transfor- 
mation has been well established for eight nuclei: for radium 
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isotopes (?*Ra, ?Ra, 222Ra, ?*Ra) the emission of carbon 
nuclei is observed, whereas other isotopes (of heavier ele- 
ments, such as protoactinium and uranium) emit neon nuclei. 
Research work in this field is expanding, and so fresh inte- 
resting results are forthcoming, beyond doubt. 

All the known kinds of radioactive decays can be divided 
into two groups: single-step (elementary) and two-step 
(complex) decay. The first group includes a-decay, ß-decay, 
spontaneous fission, proton and double-proton activities. 
These processes are all concerned with penetrating a poten- 
tial barrier (except for B-decay, which is due to weak inter- 
actions). The second group includes processes in which 
ß-decay is a preliminary stage and then an excited resultant 
nucleus emits delayed particles, for example protons and 
neutrons. When comparing the ways in which the atoms, 
members of these group, change, one should again pay 
attention to a certain ‘regulating’ role of the factor of time 
in the concepts of modern atomistics. There is an important 
difference between ‘true’ radioactivity and the emission of 
delayed components, although there is a measure of com- 
monness, namely an arbitrary temporal boundary of the 
existence of nuclei that undergo simple or complex changes, 
which is about 101? s. If we ignore this circumstance, then all 
the decays of compound nuclei (formed in the first stage of 
artificial nuclear reactions) should be considered radioactive 
decays, which would obviously be wrong. 

Atoms are changed (transformed) artificially in various 
forms of nuclear reactions. But not all nuclear reactions 
necessarily have as a product a radioactive nuclide. In many 
cases the product is a stable nuclide—an isotope of an 
element with another value of Z or also a stable isotope of 
the target element, for example, for a (n, y) reaction. The 
latter does not change Z, and hence the chemical individu- 
ality of the atom is not changed. 

By artificial nuclear reactions they normally understand 
processes in which atomic nuclei change through interaction 
with elementary particles (n, p) or with one another (when 
the bombarding particles are alpha-particles, deuterons, 
tritons, ®?He nuclei or multiply charged ions with Z > 2). 
Nuclear reactions are recorded in a way similar to chemistry 
for bimolecular reactions: A + a- B + b, where A is the 
nucleus of the target element, a is the bombarding particle, 
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B is the resulting product, and b is the emitted particle. 
Commonly, the shorthand notation A(a, b)B is used; it 
was suggested in 1935 by H. Bethe. 

Against the background of an infinite variety of chemical 
reactions (which involve the electron configuration of atoms) 
nuclear reactions are relatively few in number. This is due, 
first, to the limited choice of bombarding particles with 
Z< 2, and, second, to the incomparably more rigid energet- 
ic conditions. Of especial interest are nuclear reactions on 
heavy multiply charged ions (Z > 2), which primarily fol- 
low the scheme A(HI, zn)B, where HI stands for ‘heavy ion’. 

The following is the list of the nuclear reactions involving 
neutrons and charged particles with Z< 2 realized by now: 

(4) neutron reactions: (n, a,) (n, p), (n, ap), (n, np), 

(n, y), (2n, Y); 
(2) proton reactions: (p, zp), (p, zn), (p, zpyn), (p, d2n), 
(P, X), (P, t), (p, zæ); 

(3) deuteron reactions: (d, zn), (d, p), (d, xpyn), (d, ap); 

(4) triton reactions: (t, p), (t, n); 

(5) alpha-particle reactions: (a, zp), (a, zn), (a, xpyn); 

(6) ?He nucleus reactions: (He, zn), (He, e); 

(7) photonuclear reactions: (y, zn), (y, xp). 

It is seen that sometimes the number of emitted particles 
can be more than one (and sometimes different particles are 
emitted, e.g. n and p). 

These reactions were used to obtain the overwhelming 
majority of artificial radioactive nuclides with Z< 101. 
Nuclides with larger Z can only be synthesized by heavy-ion 
nuclear reactions (e.g. !2C, 14N, Ne, “Ar, Cr, and so on), 
since in this case it becomes fairly difficult to select a suita- 
ble target. 

In the general form any nuclear reaction can schematical- 
ly be written as: 


{Z, A} (x, y) {Z+ (Zx— Zy), A+(Ax—Ay)}. 


Here {Z, A} are the charge and the mass number of the tar- 
get nucleus; z is the bombarding particle; y is the emitted 
particle(s); Z, and Z, are the charges of z and y, respec- 
tively; A, and A, are the mass numbers of x and y, respec- 
tively. 

In a decay atoms change not much in terms of Z. The 
nuclear charge can decrease by one (ßt-decay, orbital cap- 
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ture, proton decay) or by two (a-decay, 2p-decay, 2f+- 
decay) and increase by one (ß”-decay) or by two (2B--decay). 
An exception is spontaneous fission in which the values of Z 
of fission fragments differ markedly from those of the orig- 
inal nucleus. For nuclear reactions, however, the degree of 
change varies within a wider range. For light-particle reac- 
tions Z generally changes by 1-3 units; there are far fewer 
reactions in which Z increases by one or two, or remains 
unchanged. For the disintegration reactions (e.g. U (p, 
6p21n)*;2Fr) or heavy-ion reactions the degree of change is 
more appreciable. 

Therefore, an important feature of modern atomistics is 
that the transformation of atoms is now a reality, and can 
be done over a wider range and in more ways than are seen 
in nature. 


4.4 Radioactivity: Process and Property 


The current definitions of ‘radioactivity’ differ only in 
details. Normally by radioactivity is understood spontane- 
ous (i.e. independent of any external factors) transformability 
of an unstable isotope of a given element from the ground, 
or metastable, state into an isotope of another chemical 
element, the process being accompanied by emission of ele- 
mentary particles or nuclei. This definition contains inac- 
curacies. So, the emission of a neutron does not lead to 
the formation of an isotope of another element (with another 
Z), but to the formation of another isotope of the same 
element. Spontaneous fission gives rise to isotopes of two 
(very rarely three) elements that differ drastically in Z. 

With these corrections in mind radioactivity can be de- 
scribed as spontaneous transformability of an unstable 
nuclide with a certain combination of Z and N into (an)other 
nuclide(s) with other combinations of Z and N accompanied 
by emission of material particles of a definite sort (elemen- 
tary or complex). This definition does not carry some fun- 
damentally new information, since, just like others, it has 
one orientation: the phenomenon of radioactivity is treated 
as the process of changing the material structures, which can 
be described by an equation of natural nuclear reaction of 
transformability. 
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Such definitions of radioactivity are, as it were, dynamic. 
But a static approach is also possible here: the characteristic 
of radioactivity as a property of nuclides, whose composition 
is given by a certain combination of Z and N, to undergo 
changes of their structures. In that case, the above-men- 
tioned property of a nuclide could be treated as a primary one, 
whereas the realization of this property would appear to be 
secondary. In other words, the transformability of nuclides 
is a consequence of the objectively existing property of 
instability of nuclides with certain combinations of Z and N. 

In consequence, among the set of nuclides some subsets 
can be distinguished that contain either stable or unstable 
nuclides. Stability implies that a nuclide with a given com- 
bination of Z and N remains unchanged indefinitely; its 
structure can change under external impacts, through arti- 
ficial nuclear reactions of some type or other. On the other 
hand, instability implies that a nuclide can spontaneously 
transform into (an)other nuclide(s). However trivial, the 
term ‘instability’ is more suitable to characterize the phenom- 
enon that for nine decades now has been known as radioac- 
tivity. The term ‘radioactivity’ itself now is rather a tribute 
to tradition. It is formed of the Latin words radio for ‘I emit 
rays’ and activis for ‘active’, i.e. it characterizes the emission 
of rays. We now know that only the so-called y-activity 
(which is not accompanied with ‘nuclide transformations) 
is purely radiant in nature. It would not pay, however, to 
discard a historically established and common term. 

Several realizations of instability (kinds of radioactive 
transformability) have been found experimentally. This sug- 
gests that this general property has several manifestations 
that differ in realization mechanisms. Put another way, 
there exists a certain set of ‘instability properties’, each of 
which has its specific features. Further, note that underlying 
individual ‘instability properties’ are fundamentally differ- 
ent kinds of interactions. So the alpha-particle and proton 
emission and spontaneous fission are due to strong interac- 
tion, whereas ß-transformabilities are due to weak interac- 
tion. 

These two ways of looking at radioactivity, as a property 
and as a process, are of great methodological importance, and 
should be taken into account when a more profound evalua- 
tion of it is desired. 
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4.5 On the Limit of the Existence of Atomic 
Structures of Matter 


As we have seen, nature imposed a fairly rigid limitation on 
the existence of natural species of atoms. The ‘material 
resources’ of nuclides, which can be regarded as stable, 
are represented by relatively few combinations of Z and A 
(totalling 280); for 20 chemical elements the only ‘stable 
combination’ of the numbers of protons and neutrons in 
nuclei is realized. In a yet lesser degree are represented 
natural radioactive nuclides that enter into radioactive 
families. The nuclides thorium-232, uranium-235, uranium- 
238 are ‘generators’ of the natural isotopes of secondary 
elements. Since thorium and uranium are radioactive them- 
selves, their resources in nature decline, slowly but steadily. 
It has been estimated that during the time the Earth has 
existed the concentration of uranium-235 dropped 33 times. 
Uranium-238 declined by more than 40%, and thorium-232 
by about 16%. It is straightforward to work out when on 
our planet atoms of ancestors of radioactive families will 
vanish. The time is long even on astronomical scale, but 
the tendency is there. If we assume that the Earth will last 
that long, then after an interminable period of time it will 
only consist of stable atomic species. But it may well be 
asked: How absolute is this stability? In some of nuclides 
that previously were regarded as stable a weak radioactivity 
has been found. Or perhaps stability is a relative thing in 
general, which depends on the time scale used? 

And so, speaking about the limit of the existence of atom- 
ic structures of matter, we must formulate the problem 
more clearly: What are the limits of artificial production 
of new atomic species? Some believe (see, e.g. [31]) that an 
absolute, in a way, boundary of stability (relative, of course) 
will be achieved if the lifetime of a nucleus is of the same 
order as the duration of nuclear reactions (i.e. about 102° s), 
although this estimate is not undisputable. For the moment 
we will stop discussing this problem and make a small 
historical digression, which is of direct concern for the 
evolution of modern atomistic concepts. 

When in 1870 it was established that uranium had the 
Jargest atomic weight, the reason why the periodic table 
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terminated at uranium could not but attract attention. The 
scientific community awaited discoveries of heavier ele- 
ments, but none were forthcoming. Radioactivity research 
provided only a qualitative explanation: the transuranium 
elements are not to be found in nature because they are 
strongly radioactive. But why then even the nearest trans- 
uranium neighbors would have a ‘survivability’ worse 
than uranium and thorium? Only the arrival of the atomic 
model and its quantum interpretation made it possible 
to have some ingenious insights. 

In 1930 E. Rabinovitsch and E. Thilo in their classical mo- 
nograph The Periodic System of the Elements. History and 
Theory pointed out that the reason why the system is exha- 
usted should be sought for in two directions. ‘Either this 
reason lies in the instability of atomic nuclei, or it is asso- 
ciated with the fact that the instability of heavy nucleican 
depend in some previously unknown manner on the structure 
of their electron shells ... . It may be that between both 
electrons revolving in orbits close to the nucleus and nuclei 
themselves also emerge some interactions, which eventually 
lead to the instability of nuclei’ [32]. Although the authors 
thought the first reason more likely, there were at the time 
no theoretical interpretations—the hour of nuclear physics 
did not struck yet. 

By contrast, the second reason received more attention 
from theorists. The ‘magic number 137’ already figured in 
their papers, it was treated as the limiting number for the 
existence of atoms (it was assumed that the nucleus has 
a ‘point’ character). In the hypothetical atom with Z = 137 
an electron from K-shell had to be captured immediately, 
thereby making further growth of Z impossible. To be sure, 
these calculations did not take into account the factor of 
nuclear instability as such. This line of estimating Zer was 
also pursued in later studies on the basis of already more 
refined theoretical concepts (specifically, the nucleus was 
assumed to have a finite size). Now the resulting values of Z 
were higher (more details about these works are given 
in [17, 21]). However abstract, the line, as we shall see 
later, should not be discarded. 

There are other considerations, however. Computer calcu- 
lations of Zer suggest that the approach of As-electrons of 
K-shell leaves the state of the atom’s outer electrons un- 
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changed. This sort of removes the problem of the upper boun- 
dary of the existence of atoms as carriers of the chemical 
properties of elements [21], or the problem becomes incorrect 
in terms of modern quantum electrodynamics. And how 
about nuclear instability? 

Some clues came from the successes of the synthesis of 
transuranium nuclides and detailed studies of their proper- 
ties. It thus became obvious that the upper boundary of the 
periodic system (uranium with Z=92) is natural, since 
none of the isotopes of the transuranium elements synthesized 
has a half-life compatible with the Earth’s age. However, 
taking into account the fact that secondary neptunium and 
plutonium form constantly as free neutrons bombard urani- 
um, we can in principle move the natural upper boundary 
of the system to the right by two atomic numbers. But the 
need to estimate the upper artificial boundary (in Z) of the 
existence of atomic structures of matter, or rather the 
limit of their synthesis, became as obvious. 

Early in the 1960s it seemed that it was only a question 
of time when this limit would be achieved. The heavy 
transuranium elements became ever more unstable in relation 
to spontaneous fission, their half-lives being now measured 
in seconds and fractions of a second. Extrapolations of the 
data led one to conclude that Z,, beyond which there could 
be no synthesis lies within the range 108-110. But, as so 
often in science, a brilliant insight gave things an unexpect- 
ed turn: it was the hypothesis of the so-called stability is- 
lands (SI) for nuclei in the regions of some of the magic values 
of Z and N that lie well beyond the frontiers of the science. 
Islands were assigned to Z=110, 114, 126 and even 164 
and N= 184 and 196. It was assumed that the correspond- 
ing nuclei must exhibit heightened stability to spontaneous 
fission. This gave grounds for optimism that some of these 
‘superelements’ could be synthesized or found in nature. 
But the intervening two decades saw no hard evidence: 
neither ‘fantastic’ nuclear reactions, (e.g. U + U), nor the 
many tonnes of processed natural materials could provide 
any experimental support to the SI hypothesis. And instead 
of the early enthusiasm came a fairly cautious attitude. 
The ‘final judge here will be the future. 

But the coming of the SI hypothesis at the same time 
introduced a measure of uncertainty into the estimates of 
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Zer and the modern understanding of the periodieity. 
Should the hypothesis be proved, the value of Zer will, 
generally speaking, be dependent on one of the two factors. 
It might be determined by a certain boundary of nuclear 
instability. But where does this magic boundary lie (beyond 
Z=114, 126 or 164)? On the other hand, for example, 
although a nucleus with Z=164 is allowed and it would 
be relatively long-lived, the corresponding atom would be 
unable to exist, forbidden for another reason (the interaction 
of electron configuration with nucleus); this reason, as 
mentioned above, has been held in question. Therefore, the 
artificial upper boundary (in Z) for atomic structures of 
matter remains an enigma of modern atomistics. 

A simile would be in order here. For the set of unstable 
atoms in the interval Z= 84-107 (i.e. naturally occurring 
and synthesized) a remarkable feature stands out. In this 
interval first come the elements all of whose isotopes are 
short-lived and secondary in origin. The ‘mainland of 
stability’ terminates at bismuth (Z=83) and a ‘sea of 
instability’ starts. But in the latter soon emerges a ‘stability 
island’ around natural long-lived primary isotopes of 
thorium and uranium. Then comes a stretch of ‘shallow 
waters’: individual isotopes of synthesized neptunium, 
plutonium, americium and curium, which have rather long 
lifetimes, at any rate longer than for the isotopes of the 
pre-thorium elements. Then the ‘depth’ starts to grow, and 
the synthesized nuclide with Z=107 has a half-life as 
short as 0.003 s. Perhaps this means that the ‘Mariana 
Trench’ is near? The notion that Zer lies within the inter- 
val 108-110 is a direct indication of this. The SI hypothesis, 
on the other hand, predicts an ‘uplift of seabottom’ at 
certain points: the emergence of new ‘islands’. This reasoning 
suggests that after Z=83 ‘islands’ can be expected. 

The ‘island’ elements with magic Z would be sort of 
ancestors of radioactive chains with a trend for Z to decrease 
gradually (just as it occurs in natural radioactive families), 
if spontaneous fission would not be in these regions of ‘super- 
elements’ a predominant reason of ‘death’ of corresponding 
nuclides. 

Since all attempts at breakthroughs in the region of 
‘islands’ failed, the synthesis of transuranium elements 
(quite slack now) follows the well-trodden path: increasing 
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Z by one in succession. There is no saying now how long will 
it yield results. But there is one consideration to take into 
account: even if a nucleus with a given Z is synthesized suc- 
cessfully, will it imply that the result is an atom of a new 
element? For an atom-like structure to form the synthesized 
nucleus must acquire a certain electron configuration. This 
requires a time of the order of 10 3-10? s. If the nucleus’s 
lifeteme is shorter, it makes no sense speaking about syn- 
thesis of a new atom, about any chemism of the resultant 
nuclide. 

Let us now turn to another sequel of the SI hypothesis. 
Clearly, our understanding of the laws governing a phenome- 
non is the more profound the larger the number of objects 
covered by them. As long as the periodic system terminated 
at uranium, the treatment of the periodicity appeared quite 
rigorous and consistent. But already a detailed study of the 
properties of the so-called actinides (their synthesis greatly 
increased the number of known chemical elements, and 
thereby the scope of the periodicity) showed that in this 
region of Z the periodic variation of properties becomes more 
complicated in nature (for more details see [33-35]). When 
synthesizing transactinides researchers had to deal with 
their isolated atoms, therefore even a rough estimate of the 
chemism of an elements appeared to be a challenging and 
delicate work, and the experimental results obtained cannot 
be viewed as final. It is quite obvious that when the issue 
of ‘island superelements’ came to the fore, the scarcity of 
new atoms was not to be overlooked. Accordingly, it was 
quite desirable to predict the electron configurations of 
these atoms and the most important properties of the cor- 
responding elements. The predictions were made on a comput- 
er (a review of appropriate studies is given in [26, 27]) for 
Z = 104-172. It was found that for the given interval of Z 
one was to expect a further complication of the phenomenon 
of periodicity, which would in essence destroy the current 
conceptions of the laws governing the periodic system. 
Whether these predictions come true or not we cannot judge 
so far. A successful synthesis of at least one ‘superelement’ 
would provide exceedingly important fuel for discussions. 
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5. Concluding Remarks 


It-was shown that the modern set of atomistic concepts and 
ideas are eventually a result of the development of physical 
views in the 20th century, a product of primarily physical 
theories and experiments. This is a fundamental difference 
of the atomistics of these days from the atomistics of the 
last century. At the same time there are many gray areas to 
clarify which perhaps some new fundamental discoveries 
will be required, similar to those in the 1890s. 

In classical atomistics major breakthroughs were the 
discoveries of new atomic species (new chemical elements). 
Some ofthem became known owing to applications of physic- 
al techniques (spectral—optical and in the 20th century 
also X-ray ones); but, as a rule, nearly concurrently these 
elements were chemically identified, i.e. isolated in free 
form or as compounds. But during this period classical 
atomistics dealt in practice with exceedingly large numbers 
of atoms of chemical elements; even the rarest elements were 
available in weight amounts. With radioelements the picture 
became much more complicated. Sooner or later researchers 
succeeded in isolating most of them, although in tiny 
amounts, but in all cases they were dealing with elements 
that really occurred in nature. 

The atomistics of the 20th century brought to life atomic 
species that are generally not to be found in nature. Techne- 
tium, promethium and transuranium elements up to Z = 
100 are now routinely accumulated in weight amounts 
(although for some heavy actinides these amounts are 
measured in micrograms). They can now be subjected to 
extensive chemical studies. Some of them are even produced 
in commercial quantities. Half-lives of nuclides with Z > 
100 are in the region of minutes, seconds or split seconds. 
It is quite obvious that ‘accumulation’ of these nuclides is 
out of the question; only some of them can be synthesized 
in practice. Therefore, even the very phrase ‘the discovery 
of a new element’ assumes here a specific meaning: the results 
obtained can with good reason be only interpreted as evi- 
dence for the possibility of synthesizing atoms with certain 
values of Z through some types of nuclear reactions. Just 
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born, such atoms ‘die’ in a short while. To investigate them 
it is necessary to reproduce the synthesis process. 

Here we find ourselves confronted with a question, a cor- 
rect answer to which seems to be of great importance. Exper- 
iments with isolated atoms of heavy transuranium elements 
yield results providing an insight into the chemical nature 
of the elements involved (this, for example, was the case 
with elements with Z = 104, 105). Suppose these elements 
could be synthesized in far larger amounts (which is at 
present impossible, of course). Would an evaluation of their 
chemism lead to the same results or not? Could we speak 
about the adequacy of the chemical behavior of individual 
atoms and their sufficiently large ensemble? At any events 
there is no convincing evidence in favor of the adequacy, and 
it still remains to be proved experimentally, or disproved. 

Whereas classical atomistics, which studied stable ele- 
ments (both known and discovered), dealt with inseparable 
combinations of their isotopes (save, of course, for the cases 
of monoisotopic elements), the atomistics of the 20th century 
discovered in nature (natural radioactive nuclides) and then 
worked out techniques of synthesis (artificial radioactive 
nuclides) of a large number of atomic species, and so from 
the very beginning it was oriented at the properties of 
individual elements. 

The physical justification of the periodic law became 
a great triumph of modern atomistics. But in terms of real 
information content, its physical formulation, which states 
the periodic variation of properties of elements with increas- 
ing Z, has not changed significantly in comparison with its 
chemical, Mendeleev’s, formulation. Furthermore, it is now 
well known that for large Z relativistic effects cause the 
periodicity to smear out, thus making it quite difficult to 
place rationally the elements in the periodic system. It is 
too early at this stage to raise the question of introducing 
any corrections into the formulation of the law, but under 
the pressure of new experimental facts and theoretical 
predictions certain corrections might be quite likely. 
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State-of-the-Art 
in ‘New Atom’ Research 


V. I. Goldanskii and V. P. Shantarovich 


l. Introduction 


One feature of the vehement growth of physics in the 20th 
century, is, undoubtedly, the correlation of this science with 
other fields of human endeavor, their mutual enrichment. 
The recent three decades have seen the spectacular advances, 
for example, in a new area at the junction of elementary- 
particle physics and chemistry, the so-called chemistry of 
new atoms, in which either an electron is replaced by another 
negative particle or a proton by another positive particle. 
This area of research does not confine itself to chemical 
studies alone. External influences on the properties of new 
atoms, and also those of mesons and positrons, provide use- 
ful, sometimes unique, information about the structure of 
solids. 

Atoms with an electron replaced by a negative meson are 
called mesoatoms. When chemically bound to conventional 
molecules and radicals they form mesomolecules. Typical 
examples are u- and n-mesoatoms. 

Also possible in principle are K-mesoatoms, in which an 
electron is replaced by a negative hyperon, antiproton, or 
antihyperon. 

Atoms with a proton replaced by a positive particle are 
called by the name of that particle with an ending ‘ium’. 
Since a penetration of any positive particle into a nucleus 
is hindered by Coulomb repulsion (which can be compensated 
for by the nuclear forces alone), for light positive particles 
not involved in nuclear interactions, such as positrons e+ 
and pt-mesons, the atoms of this type cannot have nuclei 
consisting of many nucleons. These atoms only form when 
a positive lepton replaces a single nucleon, namely, a pro- 
ton. In that case, an atom of a positron and an electron is 
called positronium (Ps). A bound system of u+-meson (posi- 
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tive muon) and an electron is known as muonium (Mu). 

Both atoms can be treated as light isotopes of hydrogen. 
What is more, positronium is the lightest of all known new 
atoms, theonly atom without a nucleus. In fact, since both 
constituent particles have the same mass, the centre of mass 
does not coincide with any of them (otherwise such a particle 
en be regarded as a nucleus), but lies strictly in between 
them. 

A nuclear proton can also be replaced by a positively 
charged hyperon in much the same way as a neutron by 
a neutral hyperon. The result is the so-called hypernuclei, 
and the corresponding atoms are referred to as hyperatoms. 
The structure of their electron shells and their chemical 
properties are, however, essentially the same as in conven- 
tional atoms with the same nuclear charge. Therefore, hyper- 
atoms are generally not included into the list of new 
atoms. 

Interest in new atoms chiefly comes from the fact that 
their characteristics are appreciably influenced by the 
chemical properties and the structure of their environment. 
We will be looking at them in more detail in the next sec- 
tion. Examination of the characteristics of new atoms can 
supply unique evidence concerning the properties of the 
environment. One important characteristic of new atoms is 
their lifetime (all of them unstable). The decay characteristics 
of new atoms are studied using the apparatus of nuclear 
physics. 

The first stage of new atom research is the study of the 
laws by which the environment affects the characteristics 
and fate of new atoms. The second stage, more important for 
other research fields, consists in obtaining more complete 
and new structural and chemical data or data on the kinetics 
of certain processes based on observations of characteristics 
of formation, transformation and disintegration of various 
new atoms. 

At the present time both the investigations into anni- 
hilation (death) of positrons and positronium and the 
mesochemical studies of u- and m-mesoatoms and muonium 
have attained the second stage and started to yield informa- 
tion of general chemical significance. 
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2. Characteristics of New Atoms 


Table 1 is a summary of some of the properties of elementary 
particles constituting the new atoms. Particle masses are 
given in relative (in relation to the electron mass) units 


Table 1 Some Properties of the Elementary Particles of Which New 
Atoms Are Made Up 

















Name Symbol rel. units MeV/c? Charge Lifetime, s 
Electron e 1 0.511 —1 Stable 
Positron et 4 0.544 +1 Stable 
-Meson pt 206.77 | 105.66 +1 | 2.26x 10-8 
(muon) 

n-Meson nË 273.15 | 139.58 +1 | 2.56% 10-8 
(pion) 

Proton P 1836.1 938.2 +1 Stable 


and in ‘energy’ units that correspond to the Einstein equa- 
tion. Unstable mesons decay in the following way: 


pt > e+ +v (ve) + Yu (vy) (4) 
nt —> Bs Vy Cy) (2) 


where v and v are the corresponding neutrino and anti- 
neutrino. The positron as such is a stable particle and it only 
disappears as a result of its interaction with the environ- 
ment. In the previous section we have already mentioned 
that positronium and muonium are hydrogen-like. Replace- 
ment of the electron in the hydrogen atom by n-or p`- 
meson yields mesohydrogen. 

Characteristics of hydrogen-like new atoms are given in 
Table 2. The reduced mass—a parameter that is fairly im- 
portant for determining other characteristics of new atoms— 
is computed from the given masses of the constituent parti- 
cless m, and m: 

u = mm (m + m,). 


In terms of reduced masses positronium has only half the 
mass of the atomic hydrogen, while in terms of absolute 
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Table 2 Properties of Hydrogen-Like Atoms 

















Name Symbol/Composition u r, cm Vion’ eV 
Hydrogen H pe 1 0.529 x 10-8 13.595 
Positronium Ps ete~ 1/2 1.05 x 10-8 6.80 
Muonium Mu pte- 0.995 | 0.532 x 10-8 13.539 
M-Mesohydrogen | — Pu” 185.8 2.85 x 10-11 — 
n-Mesohydrogen | — pn  |237.8 2.23 x 10-1 — 


masses the latter is 920 times heavier. Because of the differ- 
ence in reduced masses the ionization potential Vion 
(i.e. the binding energy) of positronium is half that for the 
hydrogen atom, and the distance r between the electron and 
the positron is twice that between the proton and the elec- 
tron (twice the Bohr radius). Although positronium con- 
sists of stable particles (the electron and positron are 
particle and antiparticle), they can kill each other (anni- 
hilate) and give off y-quanta. This process is responsible for 
the instability of the positronium, and for its extremely 
short lifetimes (~10-7 and 10-!° s in a vacuum). 

Although muonium and atomic hydrogen differ markedly 
in mass, their reduced masses and hence some other charac- 
teristics are similar. 

The main feature of mesohydrogen is that the radius of 
the meson orbit is about 1/200 of the Bohr radius. There- 
fore, the positive charge of the proton is highly screened 
and in many respects mesohydrogen behaves as a neutral 
particle similar to neutron. 


3. Positrons and Positronium 


At present the bibliography of this expanding area of 
research [1] includes about 2,000 publications. The details 
of the theories and experimental techniques are discussed 
in books [2, 3] and in a number of review articles [4-12], 
including those devoted to the general overview of new 
atoms [10]. Recently the 6th and 7th International Confer- 
ences on Positron Annihilation were held [13, 14]. Experi- 
mental data are summarized in tables [15, 16]. 
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This review does not claim to be a profound analysis of 
the current work. Its objective is to give a general idea of 
new atom research and of applications of this research. 


3.1 Annihilation and Interaction 
of Positrons with Matter 


As a positron and an electron interact, their kinetic energies 
and rest masses transform into y-quanta. This process is 
called annihilation and is governed by general physical laws 
of conservation (of energy, momentum, parity, etc.). 

By the law of parity conservation in annihilation two 
or three y-quanta are produced depending on the relative 
orientation of the spins of the electron and the positron 
(for parallel spins three quanta, and for antiparallel spins 
two quanta). 

In the case of a two-photon annihilation of an electron- 
positron pair at rest the y-quanta fly off in strictly opposite 
directions (at 180°) in accordance with the law of momentum 
conservation. The energy of each quantum is equivalent to 
the electron rest mass (0.5 MeV). Slight deviations from 
0.5 MeV of the photon energies or from collinearity of the 
emission directions give valuable information about the 
properties of matter and are the core of a number of experi- 
mental techniques. 

The fate of energetic positrons emitted by radioactive 
sources (e.g. ?Na) is annihilation. But before this really 
happens some processes may occur, which are shown in 
Fig. 1. In a condensed phase positrons are slowed down in 
only several picoseconds (mainly by ionization and excita- 
tion of surrounding molecules). A fraction of positrons 
undergo free annihilation with surrounding electrons. In 
liquids this process may be preceded by solvation. Some 
positrons may get involved in chemical reactions and further 
annihilate from the bound state Me+ with a molecule or ion 
M. In the case of free annihilation a two-photon event is 
372 times more probable than a three-photon event. And 
Dr the free annihilation of positrons the lifetime is about 
0.4 ns. 

The remaining positrons form a bound positron-electron 
systems called positronium (Ps). Depending on the relative 
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Fig. 1 Scheme of positron processes 


alignment of spins of the electron and the positron, positron- 
ium can have two states: singlet or triplet. The properties 
of the positronium atoms are tabulated in Table 3. It is 


Table 3 Properties of the Positronium Atom 
Type of State 


Property 


Singlet Triplet 
Symbol sps iPs 
Spin state tt 4 
Quantum number J=0 J= 
Share, % 25 75 
Proper lifetime, s 1.25 x 10-10 1.4 X 1077 
Annihilation type 2y 3y 


seen that the proper lifetime (in a vacuum) of a triplet posi- 
tronium (3y-annihilation) is three orders of magnitude 
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longer than for a singlet positronium. But a variety of 
interactions, shown in Fig. 1, shorten the lifetime of posi- 
tronium in a condensed phase down to several nanoseconds. 
In positronium-forming systems the free annihilation of 
some of triplet positronium increases the share of 3y-anni- 
hilation as compared with the case of free annihilation 
of positrons. 

Measurements of lifetimes of triplet positronium and the 
angular distribution of the annihilation radiation of singlet 
positronium provide an insight into positronium reactions 
and the chemical properties of the environment. These 
studies are the main part of positronium chemistry. 


3.2 Positronium Formation 


As is seen in Fig. 1, not all positrons (and not in all substances) 
form positronium. Experimentally, this share can be 
estimated, for example, by measuring the intensity of the 
positronium-related long-lived component. Now there are 
two views of the process of positronium formation, although 
their combination seems to give the most complete picture 
of the phenomenon. 

By the model of the Swedish physicist Ore [17] positronium 
can form within a narrow energy interval between the ioniza- 
tion potential of the surrounding molecules, Vion, and 
(Vion — 6.8) eV, where 6.8 eV is the binding energy for 
positronium (half a rydberg). If we take into account the 
excitation of the first electron level E, of surrounding mole- 
cules, a process that competes with positronium formation, 
the interval will become narrower; and eventually for 
a positron that effectively forms a positronium, the energy 
interval will be E, < Eet < Vion — 6.8, and the propor- 
tion of positrons that form positronium, P, will be given by 





6.8 Eı— (Vion — 6.8) 
Viggen Ey (3) 


Positronium here can have a kinetic energy from zero to 
Vion or to Æ. 

The model only gives a correct prediction of positronium 
formation in inert gases. For multiatomic gases and especial- 
ly for liquids the assumption of fast moderation or epither- 
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mal positronium reactions are generally applied. But there 
are no accurate data on reaction thresholds and inelastic 
moderation cross-sections for positronium in liquids, and so 
any predictions before experiment are difficult. 

Another, so-called spur, model was first suggested by 
Mogensen [18] and developed by other authors [19-21]. This 
model better explains the evidence for positronium forma- 
tion in liquids. The underlying idea of the spur model is the 
assumption that positronium forms when a positron at the 
end of its moderation combines with one of the electrons of 
its own spur. By spur is here meant the unstable cluster of 
electrons, positive ions and radicals that emerge as a positron 
is moderated in a liquid and perhaps in some solids. 

‘The probability of positronium formation in the positron 
spur depends mainly on the rates of the competing processes. 
In the general form the process can be represented as 
follows: 


ete 5 Ps, (4) 
e4 M+ “3 Mr, (5) 
e +S, i products, (6) 
M* +S, z products, (7) 
eps, 5 St. (8) 


In pure solvent, obviously, only reactions (4) and (5) should 
be considered. In the presence of active impurities 5,, S3, S, 
the processes (6)-(8) also play a part. They lead either to 
inhibition or an increase in the percentage of positrons that 
form positronium. The rates of the processes (5)-(7), and 
hence the probability of positronium formation, are quite 
sensitive to the physico-chemical properties of the environ- 
ment and they are measured in radiation chemistry. No won- 
der, therefore, that a number of interesting correlations have 
been found between positronium chemistry and radiation 
chemistry [22-29]. Relative contributions of the two mecha- 
nisms have not yet been determined; they of course depend 
on the properties of the system in question. Much work is 
being done currently in this direction (modelling [30, 31], 
studies of the influence of an electric field on positronium 
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formation [32, 33]). Recently a new technique of single- 
photon count with radioactive (including positron) sources 
[35] has appeared, which allowed observations of the 
correlation of inhibition of the recombination fluorescence 
intensity and positronium formation by the same accep- 
tors [36]. The presence of this correlation is evidence of the 
predominance of spur processes in positronium formation 
in liquids. In solids positronium observations are sensitive to 
the presence and characteristics of the so-called free vol- 
ume [37]. 


3.3 Positron Annihilation Experiments 


Information about the fate of positrons and positronium and 
about the properties of their environment can be obtained 
from examining the characteristics of the annihilation 
radiation emitted in positron destruction. Each of the 
techniques available is aimed at investigating certain 
characteristics: 

(1) The number of y-quanta emitted in annihilation. The 
corresponding method of double or triple coincidences is the 
least informative. 

(2) The angular distribution of annihilation radiation. 
Here information is derived about the momentum distribu- 
tion of electrons involved in annihilations. If positrons or 
positronium take part in chemical reactions, the shape of 
the distribution changes. 

(3) The energy spectrum of annihilation y-quanta (devia- 
tion from 0.5 MeV registered using a Ge(Li)-semiconductor 
detector) also represents the energy distribution of electrons 
involved in annihilation. 

(4) The time distribution of annihilation radiation. It is 
strongly dependent on electron density at the location of 
a positron or positronium and on the rate at which posi- 
tronium enters into chemical reactions. 

Various ß*-active isotopes can be used as positron sources. 
Of these ??Na is the commonest both due to its rather long 
half-life (2.58 years) and to its applicability in various 
experimental arrangements. The decay scheme for **Na 
is shown in Fig. 2. Depending on the technique used, sources 
can have different activities which vary from tens of micro- 
curie for positron lifetime measurements and studies of the 
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x E, MeV 








be OE 0 Fig. 2 The decay scheme of the 
22Ne positron-active 22Na 


Doppler broadening of the annihilation line to a hundred mil- 
licurie in angular correlation measurements. In the first case 
activities are limited by the growth of the random coincidence 
background. A source is prepared by evaporating a drop of 
22NaC] radioactive solution on a thin (1-2 mg/cm?) metal 
foil or synthetic film that pass positrons well, the source is 
then covered with the same material (foil or film). When 
working with liquids it is also convenient to use sources 
prepared by diffusing at an elevated temperature of ?*NaCl 
into 2-4 mg/cm? glass foil [38]. Another positron source can 
be a positron beam of certain energy produced at accelerators 
[38, 39]. The last technique is commonly used in studies of 
the chemical state and physical properties of surfaces of 
solid specimens, and also in studies of the depth distribution 
of defects [40, 44]. 

The common experimental arrangements are depicted in 
Fig. 3. Shown in Fig. 3a is the triple coincidence arrange- 
ment to record orthopositronium annihilation events with 
the emission of three y-quanta. They are recorded by coun- 
ters C,, Ca and C}. The technique has the disadvantage that 
the counting rate for triple y-coincidences (C;,) is propor- 
tional to the product of two variables—the percentage of 
positrons, P, that form positronium and the probability of 
an annihilation with the ejection of three y-quanta W,, = 
Tath (Ta is the real lifetime of positronium in matter, 
and tp is the lifetime of triple positronium in a vacuum 
(see Table 3)). Therefore, C,, can go down both because of 
shorter positronium lifetime (smaller W ,,) and because of 
positronium formation inhibition (smaller P). It is desir- 
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Fig. 3 Experimental arrangements: (a) triple coincidence circuit; 
(b) observation of time spectrum of positron annihilation; (c) observa- 
tion of correlation curves 


able, however, to distinguish the contributions of these two 
interactions. 

The most variegated information is supplied by observa- 
tions of the time spectrum of positron annihilation (see 
Fig. 35) using coincidence circuits that record delayed coin- 
cidences N (t) between a nuclear y-quantum with energy 
1.3 MeV (counter(,) emitted essentially simultaneously with 
the ejection of a positron from a Na nucleus, and a 0.5- 
MeV y-quantum (counter (,) produced by a 2y-annihilation 
of that positron. Thus, the times of creation and destruction 
of that positron are recorded. The short-lived component 
of the time spectrum (lifetime t,, intensity /,) is due to the 
annihilation of positrons in parapositronium (‘Ps, see Ta- 
ble 3) and in free collisions with electrons; whereas the long- 
lived component (lifetime t,, intensity J, = °/, P) is due to 
the annihilation of the positrons that have formed triplet 
positronium. The resolving time (the width of the prompt 
coincidence curve at half maxima) for present- day apparatus 
is about 0.3 ns. 
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This technique distinctly distinguishes positronium 
quenching and positronium formation inhibition. In the sim- 
plest case of ‘pure’ inhibition the intensity 7, of the long- 
lived component decreases without decreasing the slope of 
the ‘tail’ of this component that characterizes its lifetime T.. 
But with pure quenching, t, decreases but J, remains un- 
changed. In what follows we will encounter examples of 
changes in time spectra of annihilation due to inhibition and 
quenching. Strictly speaking, when the lifetime of positro- 
nium in a solution becomes shorter (quenching) due to the 
use of active additives, kinetic equations [42] determine 
the relation between the percentage of positrons P that form 
positronium in the solution and the observed intensity of 
the long-lived component /,: 

_ 3 Mp — Ap 
heor ipl e ' (9) 


where A, is the rate of annihilation at solvent molecules, 
the so-called pick-off annihilation; A, is the rate of annihila- 
tion of positronium in bound state with active additive 
(An = 1/th, m< 0.4 ns, see Fig. 1); Ag is the positronium 
quenching rate given by 1/t, = Ap + Ag = àp + K [Ac]. 

The single-photon count technique with positron sources 
[35, 36], mentioned in Sec. 3.2, is in fact a modification of 
apparatus designed to study positron lifetimes in which the 
annihilation y-quantum detector is replaced by a single- 
photon one. 

Figure 3c represents a third major technique—the obser- 
vation of correlation curves, i.e. dependences of coincidence 
counting rate of two annihilation y-quanta (C,,) with energy 
0.5 MeV (C, and C,) on the angle 9, i.e. on the deviation from 
180° of the angle between two ejected quanta. The detectors 
are located at about 2 m from the sample behind lead colli- 
mators. Differentiating dC,,/d@ gives the momentum distri- 
bution function for electrons in matter p (P,). The narrowest 
component of the complicated correlation curve corresponds 
to singlet positronium annihilation. Therefore, by separating 
in a large experimental run the correlation curves shown 
schematically in Fig. 3c into the narrow Jy and wide compo- 
nents we can obtain important additional information about 
positronium chemical reactions, namely we can distin- 
guish the various positronium quenching forms. Actually, 
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the annihilation of orthopositronium ('Ps) on ‘foreign’ 
electrons in collisions, just as the formation of chemical 
compounds PsM in substitution and addition reactions, and 
the annihilation of orthopositronium oxidation products— 
free positrons— yield correlation curves several milliradi- 
ans wide. There are special computer programs to distin- 
guish and identify both the case of addition and the case of 
oxidation [43, 44]. Positronium conversion from a triplet 
state into a singlet one with subsequent 2y-annihilation in- 
creases in the correlation curve the intensity of the narrow 
component and can also be easily identified. 

The coming of Ge(Li)-semiconductor detectors (SD) 
with high energy resolutions (1 keV at 500 keV) opened up 
new possibilities to do without ‘angle’ installations. In fact, 
if X is the direction from a positron source to a detector, 
then since 
the electron energy E (positron is assumed to be at rest), 
the momentum component P, and the annihilation line 
broadening AE, will be related by 


E = P?/2m, = 2 (AEy)*/m,c?. (10) 


Thus, the Doppler shift of a y-quantum by 1 keV, i.e. only 
by 0.2%, corresponds to an electron energy of 3.9 eV. Since 
under experimental conditions 


sin 0 = P,/m.c ~ 8 (11) 


(Z is the direction normal to the plane passing through the 
collimator slit of the stationary detector and through the 
sample—see Fig. 3c), then in isotropic case 


0 = 2AE,/m,c?. (12) 


Here a 1-keV broadening is equivalent to a deviation of 
3.9 mrad from collinearity in quantum ejection. In other 
words, a detector with a high (in modern terms) energy res- 
olution is equivalent to an angular installation with a reso- 
lution of about 4 mrad, a resolution four times worse than in 
a conventional angular experiment. Nevertheless special 
computer programs take into account resolution functions 
and make it possible to extract the information required 
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[45]. With semiconductor detectors only one of them is re- 
quired. Also, installations are more compact and the source 
is less active. 


3.4 Positronium Quenching 


The lifetime of orthopositronium in liquids and solids 
(0.6-5 ns) appears to be much shorter than its proper lifetime 
(140 ns, see Table 3). This is due to pick-off annihilation, 
i.e. the annihilation of positronium’s positron on electrons of 
surrounding molecules. The lifetime of positronium becomes 
yet more shorter in the presence of small amounts of positro- 
nium-active molecules (quenchers). Positronium is quenched 
through its chemical transformations. To begin with, we 
will look at pick-off annihilation. 

Pick-off annihilation. Bubble formation. ‘Free volume’ 
model. This subsection is concerned with two main hypothe- 
ses of the pick-off annihilation of positronium. For liquids 
this is the so-called bubble model [46, 47], which has first been 
formulated for condensed gases [48-50]; it is also of impor- 
tance for an understanding of the mechanism of positronium 
chemical reactions. For solids, e.g. polymers, the ‘free vol- 
ume’ model is suggested [51]. 

The bubble model assumes that in the majority of liquids 
around positronium a bubble is formed [52-57]. The bubble 
size is controlled by the condition that the exchange repul- 
sion forces and the surface tension are at equilibrium. The 
specific rate of pick-off annihilation is given by 


Ap/ N = arcZuP(kR), (13) 


where for Zep we take the total number of valence electrons 
in a molecule; P (kR) = sin?(kR)(1 — kR cot kR) is 
the probability that positronium leaves the potential well 
of the bubble of depth u and radius R; k = (4m,E/h?)'/?; 
E is the positronium energy reckoned from the well bottom. 
From the minimum condition of the positronium potential 
energy in a liquid and the boundary conditions in a bubble 
follows the mathematical criterion for bubble formation 
around positronium [52]. It turns out that for the majority 
of liquids R = 345 A, ux 1 eV. 

Equation (13) easily explains the observed dependence of 
the rate of pick-off annihilation of positronium on the num- 
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ber of links in the chain of aliphatic hydrocarbons and their 
derivatives [9, 53, 58] (Fig. 4); and this equation also deter- 
mines the dependence of the rate on such characteristics of 
liquids as parachor [55], surface tension [54], and solubility 
parameter [59]. 

The considerations just described make it possible to ex- 
amine clustering in binary liquid mixtures [60]. 

When positronium is formed in solids, e.g. glasses, poly- 
mers or ice, it becomes impossible to push molecules aside 
to make room for a bubble. Under these conditions positro- 
nium can localize in microvoids, called ‘free volumes’. 
Annihilation with possible captures of positronium in free 
volumes was first considered by Brandt [61, 62]. The method 
offers new possibilities for investigation into the structure 
of a number of materials, including polymers [64-70]. Ac- 
cording to the results [37, 61-71] the lifetime distributions 
of positron annihilation in solid molecular systems have more 
than two components and for amorphous structures they 
have the following nature: t, is the component associated 
with annihilation of nonlocalized atoms of triplet and singlet 
positronium, and also of localized atoms of singlet positro- 
nium; T, is the component of annihilation of quasifree posi- 
trons or positronium localized in crystalline regions of a 
polymer [69]; t, is the component of annihilation of localized 
(captured in microvoids) atoms of triplet positronium. 

Relation of pick-off annihilation with the structure of some 
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polymer systems. In the previous subsection we mentioned 
the possibility of using pick-off annihilation to probe into 
the structure of solutions. In polymer systems, especially 
polycrystalline ones, or in filled materials the lifetime spec- 
trum of annihilation radiation looks especially complicat- 
ed. In these cases, however, special-purpose programs 
[75, 76] are used to separate the main annihilation processes 
and to derive interesting information about the structure 
of a material. By way of example we consider an investiga- 
tion of the influence of the surface properties of a filler 
(Aerosil) on the structure and properties of ethylene-propyl- 
ene vulcanized rubber [70]. 

Two cases were compared, inone of which hydroxyl groups 
were removed from the surface of filler particles. This 
prevented the surface from interacting with a polymer mat- 
rix. Of especial interest were the variations of the parame- 
ters of the intermediate component /,, T, at low concentra- 
tions of the filler (about 10% mass) (Fig. 5). Chemical inter- 
action of a polymer with the filler surface results in compac- 
tion of the polymer near grains, and formation of a compact- 
ed layer. This manifests itself through a shortening of 
Tə which is defined as the averaged lifetime of the inter- 
mediate component for the entire polymer matrix. At high 
filler concentrations T, stops decreasing and begins to tend 
to the limit, which is due to the annihilation of ever increas- 
ing proportion of positrons on the surface defects of filler 
grains. Prevention of chemical interaction of the polymer 
with the filler surface, on the contrary, reduces the density 
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of the polymer near the surface of grains. And so we end up 
with conflicting variations of t. From experimental results 
one can deduce the diffusion coefficient for positronium in 
vulcanized rubber D = (2.0+0.7) x 10-% cm?/s, and also 
the effective thickness of the reorganized surface layer of 


the polymer (about 300 A). It should be emphasized that it 
is the annihilation characteristics /,, T,, not a macrocharac- 
teristic such as the density p of the sample, that correlate 
with the rupture stress o of the resultant samples (see Fig. 
5). It follows that the positron method is suitable to probe 
into the polymer structure. 

Another evidence favoring the conclusion just made 
comes from separate studies of the oxidation kinelics of poly- 
4-methyl-pentene (PMP) in crystalline and amorphous 
phases of the same sample [74]. Oxidation occurred in 
both phases but with different rates. It was accompanied 
with the formation of hydroperoxide and, obviously, of new 
defects (free volumes). As in the first case the time spectra 
were decomposed into three components. Tao and Green 
[77] observed a correlation between the intensity J, of the 
intermediate component and the crystallinity of Teflon. This 
suggests that in polycrystalline polymers the component 
I, is associated with annihilation in the crystalline part of 
the polymer. The variation of J, and J, (especially the de- 
cline) with temperature of the PMP sample (see Fig. 6, where 
the variation of J, is shown) can only be accounted for 
by assuming that the percentage of positrons that form posi- 
tronium in crystalline regions changes. In fact, the emerging 
defects, on reaching certain concentrations, can capture 
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positrons or electrons and thus hamper the formation of posi- 
tronium by spur mechanism (see Sec. 3.2). 

From J,/I}, I/I} there were found separately the oxida- 
tion levels X for the crystalline and amorphous phases, and 
also the activation energies for respective processes: E, = 
16.9+ 2.5, E, = 22.2+0.9 kcal-mol-! (the dash line 
in Fig. 6 is plotted using E.). It is of interest that in agree- 
ment with these data is the average (for both phases) acti- 
vation energy for oxidation, found from peroxide release. 

The positron testing of polymers yielded other interesting 
evidence as well. It was shown, for example, that structural 
transformations as a result of plastic deformation of amor- 
phous polymer systems are in many respects similar to those 
characteristic of polycrystalline solids. In the initial stage 
(small strains) of transformations, in ordered regions (which 
in this case appear as microcrystallites) there develop dis- 
placement defects, and asstrain increases they migrate and 
reach the surface in these regions [72]. 

It became possible to study the details of the microstruc- 
ture of polymers that are responsible for the permeability 
of polymers for some light gases, and also the solubility of 
these gases in polymers [73]. 

In filled polymer materials annihilation characteristics 
were observed and from them effective radii and density of 
defects in the amorphous and crystalline regions were calcu- 
lated to be R = 3-6 A and N = 1018-10% cm-%, respectively. 
This enabled one to account for the nature of periodic vari- 
ations, as the filling percentage increases, of some properties 
of these materials (loss tangent and its maxima T, and T, 
as the temperature of the sample varies; melting point; and 
the real part of the modulus of elasticity E) [74]. 

Positronium chemical reactions (classification, general 
scheme). Further shortening of positronium lifetimes in various 
media (as compared with pick-off annihilation) under the 
action of tiny amounts of active additives is caused by posi- 
tronium chemical reactions. Up until the present essentially 
all types of positronium reactions have been observed and 
studied [47]. For example, oxidation: 


Ps- Fest > Fettet, (14) 
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positronium conversion from triplet to singlet state on pa- 
ramagnetic ions, molecules or electrons of a positron spur 


[78]: 


‘Ps(4$t)-+NO(}) > sPs(t})+NO(f), (15) 
substitution: 
Ps+1,%-> (PsI)+I, (16) 
my 


annihilation from bound state (addition): 
Ps-+(CF3),NO- — [Ps(CF3);NO-}, (17) 


the donor-acceptor interaction in organic systems contain- 
ing carbonium-ion-like structures M(*C:): 
te 
Ps+ M(*C : )Z7[PsM(*C : )], (18) 


——> 2y 


and various combinations of thesegtypes,of reactions. 

Thus the definition of positronium as the simplest free 
radical, a hydrogen-like atom, is justified. On the other 
hand, because the mass of, positronium is sojsmall it be- 
haves in a specific manner in chemical reactions. 

(1) Comparison of interactions of positronium, electrons 
and atomic hydrogen with halogens (Cla, Brg, I,) in gaseous 
and liquid phases showed [79] that the nearest analogue of 
the ‘hot’ (i.e. fast) positronium reaction is the reaction 
of dissociative electron capture. For thermal positronium in 
a gas this conclusion is less obvious. In liquids continuous 
interaction of positronium with solvent molecules eventual- 
ly results in its being localized in voids formed by the ther- 
mal motion of molecules, or in the formation of a bubble 
around a positronium atom. The effective mass of positro- 
nium increases. This strengthens the analogy with the inter- 
action of heavy radicals or solvated electrons. In fact, it 
is common knowledge [3, 80] that the majority of the strong- 
est ion-oxidants of atomic hydrogen (oxidation potential 
0.15 < HE, < 0.6 V) are at the same time strong quenchers 
of positronium. Conversely, ion-reducers, which possess 
a negative electron affinity, ‘quench’ positronium only when 
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they are paramagnetic (see (15)), i.e. there occurs conver- 
sion. For organic compounds a similar characteristic is the 
polarographic potential of half-wave Et? [47]. 

The idea that positronium can be represented as the sim- 
plest of free radicals was supported [81, 82] by observations 
of a correlation of the reactivity of positronium in relation 
to a number of aromatic compounds with the Hammett 
constants. Experimental [83, 84] and theoretical [85] evidence 
indicates that the conversion interaction of positronium and 
the exchange broadening of EPR lines of stable radicals in 
interaction with paramagnetic complexes are similar. There- 
fore, positronium can be employed as a useful standard to 
study such interactions. Just like other free radicals, posi- 
tronium is sensitive to the composition of the first coordina- 
tion sphere of ion-acceptors in a solution. It is even more 
sensitive than conventional radicals, the mobility of posi- 
tronium being much higher. 

(2) In a liquid positronium reacts in the diffusion-kinetic 
region, i.e. the observed reaction rate of positronium is in- 
fluenced both by the velocity of approaching an acceptor 
(which in turn is conditioned by viscosity) and by the chemi- 
cal properties of the acceptor [86]. The observed reaction rate 
constant is computed from experimental data (see Sec. 
3.3) by Kop = (4/tp — 1/7,)/[Ac], where tp and t, are the 
observed lifetimes of positronium in the solvent and solu- 
tion, respectively, and [Ac] is the acceptor concentration. 
In the general form the constant can be written as 


K o» = -fr Ko='Ko, (19) 
where w is the probability of chemical reaction per collision 
of agent with acceptor in a cell; x is the probability for 
the agent or acceptor (or both) to leave the cell in between 
the collisions; Kp is the constant of interdiffusion rate for 
the reactants. 

Although some authors attach less importance to the diffu- 
sion stage of the process [87, 88], the entire body of avail- 
able evidence concerned with positronium chemistry can 
only be explained taking into consideration both stages of 
reactions, i.e. using a dependence of the type (19). 

(3) A great part in positronium chemical reactions is 
played by the formation of the intermediate complex [PsAc], 
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Fig. 7 Scheme of positronium chemical reactions 


whose stability depends on the electron affinity of acceptors. 
Positronium interactions have been classified [47] depending 
on this characteristic of acceptors. The role of the complex 
in positronium reactions has been borne out experimentally 
when the influence of temperature on positronium reactions 
with diamagnetic organic molecules was investigated [86]. 
As the solution temperature increased complex decomposi- 
tion was becoming ever more important, which showed up 
as a characteristic (with a maximum) temperature depend- 
ence of the rate of interaction of positronium with molecules. 
In this context it was also useful to determine in angular 
correlation experiments the angular distribution components 
of annihilation radiation that are directly concerned with 
positronium annihilation from the [PsAc] complex [89, 90]. 

(4) For a given electron affinity of the acceptor the stabili- 
ty of the [PsAc] complex in gaseous phase is determined by 
the possibility to release excess energy in a three-particle 
collision with a molecule of background gas (Ps-Ac-M) or 
in interactions with the ‘wall’ of a pore of a fine-grained sys- 
tem [91]. Reactions in liquid phase are dominated by the 
collapse of a bubble formed around positronium (see above) 
at the moment when positronium is captured into the com- 
plex. The general scheme of positronium reactions with an 
account of the diffusion stage of the process, the formation 
of a complex and the possibility of its thermal decomposition 
due to bubble collapse is described in [47, 92], and also il- 
lustrated in Fig. 7. A complex is destroyed by the disturbing 
action of collapse (instability) if an acceptor has not too 
high an electron affinity (—0.8 < E12 < —0.48 V at 
pH = 7) and interaction is essentially multiple captures 
and releases of positronium from the complex, 
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Comparison of calculations with measurements shows that 
the velocity of the microbubble wall motion can be described 
by such macroscopic characteristics as surface tension o 
and viscosity n: v = o/2n. We define the rate of bubble col- 
lapse as 


Acor Zen (0/2n) (Rou z Raec)™t, (20) 


where Rpu» is the equilibrium radius of the bubble; Rae, 
is the radius at which the complex decomposes. Using (19) 
we can obtain the temperature dependence of the observed 
rate constant of the positronium reaction. For an ‘unstable’ 
complex 


BT T A E 1 
Kax ft KZ [1 He g exp (-2)]}". 
(21) 
Here B, qı, qz are constants, A, is the rate of annihilation of 
positronium in the complex [PsAc]; Ep = Ecap — Egecs 


where Ecap and Eye, are the activation energies of positron 
capture in the complex and of its decomposition, respec- 





tively. 
For a ‘stable’ complex 
REL 7 (140,7 esp (Zee )]', By > RT. (2) 


(5) The Een of tunnel effects at each of the above- 
mentioned stages of reactions in a liquid of such a light par- 
ticle as positronium was the subject of some works [88, 
93-95]. If we take into account the possible role of tunnel- 
ling at the diffusion stage (Kp in (19)) and at the reaction 
stage in the common cell of the solution (w in (19)), we can 
represent the interaction of positronium and acceptor as 
shown in Fig. 8 [93-95]. It includes the approach of agents 


Ac” + et (SPs + Ac (t)) 





Fig. 8 Chemical interaction between positronium and acceptor with 
account of tunnelling in the diffusion stage and in the stage of reaction 
in the common solution cell [93-95] 
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and their encounter in the common cell of the solution (Kp), 
the formation of a complex and the reaction itself (oxida- 
tion, annihilation from a bound state, conversion). Possible 
occurrences of quantum effects are shown in Fig. 8 by a dash 
line (the transfer of an electron e-, spin-flip exchange (tJ) 
or the transfer of positronium as a whole into a cell or a com- 
plex). 

Tunnelling at the diffusion stage was identified [94, 95] 
by slowing down the stage in water-glycerin solutions of 
high viscosity (up to 466 mPa-s) and measuring the effective 
rate constant for interaction of positronium with ‘converter’ 
FeCl, and strong positronium oxidizer K,Cr,O,. Water- 
glycerin mixtures were taken because a number of special 
experiments [59, 60, 96-99] have shown that no bubble is 
formed here, and so the influence of the diffusion stage can 
be studied in the pure form without the complicating effects 
of collapse. 

As was to be expected for diffusion-controlled reactions 
experiment yielded a linear dependence of the observed in- 
teraction rate on the reciprocal viscosity (Fig. 9). Assuming 
that for a strong oxidant the reaction occurs at a first colli- 
sion (£ = 1) and R = 5A, we can find D in a given viscosi- 
ty interval: 0.9 x 10-°-5 x 10-5 cm?/s. This is an order of 
magnitude higher than the diffusion coefficient of convention- 
al ions in a solution. Therefore, for positronium reactions 
the total diffusion coefficient is determined by positronium 
diffusion and it can be considered the same for reactions with 
all ions, including FeCl,. 
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Fig. 10 Variations of (1) effec- 
tive radius of the reaction of po- 
sitronium with FeCl, with the 
positronium diffusion coefficient 
(dots—according to equation 
(19), the solid lines—according 
to (23)) and (2) of Kop with 
i/y (the dash line is plotted con- 
sidering the influence of viscos- 
ity on the probability of spin 
3 4 5 exchange; the dash-dot line is 

Dx105,cm2/s the linear law) 





But for FeCl,, Ko» shows some deviations from linearity 
for n >40 mPa-s (Fig. 10). This deviation can be explained 
either by the influence of viscosity on the probability of spin 
exchange due to a change in the average time the colliding 
particles are in contact, or by the contribution of quantum- 
mechanical effects of tunnelling to a cell with a ‘converter’, 
or by spin exchange at a distance. The first effect shows up 
as the tending of Ko» to the limit at low viscosities (dash 
line), the other two as an increase in the effective radius of 
interaction at high viscosities (low diffusion coefficients). 
It is seen that the first point is not supported by experi- 
ment. Radii of interaction of positronium and FeCl,, com- 
puted by the formula Ret == Kop/4nD, are provided in Fig. 
10 (for low viscosities Rep = 4.5 A). For high viscosities 
Ret grows indeed and can be described quantitatively [100]. 

The dependence R,;(D) is given by [95] 


1 K 1 
Bam Rt ne) BES |= Rtr e 


where zy = poi y = 1.781 (Euler constant); K, (x), 


T, (x) are respectively the zero-order MacDonald and Bessel 
functions of imaginary argument; Re isthe radius of the com- 
plex. The parameter 1/x ~ 3 A is easily found as the tan- 
gent of the slope of the plot Ry, = f (ln D). Knowing 
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Ren D and 1/a, we can find the second parameter: vy = 
3.5 x 1010 s-t, 

From these values we can estimate for the tunnelling of a 
particle of mass m the initial binding energy E from the re- 
lationship E = h?a?/2m. At m = mp we have E= 
0.2 eV, which is close to the binding energy for positro- 
nium at a localized state [47], but much lower than what 
might be expected for the transfer of one electron torn away 
from positronium (oxidation) [101]. This illustrates the con- 
tribution of tunnelling to conversion of positronium on FeCl, 
in the stage of diffusion approach. 

The appearance of diffusion stage in positronium reactions 
is important because this confirms the above explanation of 
specific (with a maximum) temperature dependences of reac- 
tion rates for positronium and diamagnetic organic mole- 
cules. To ignore this stage and to account for the effect by 
the tunnelling transfer of charge during chemical interac- 
tion would be wrong here also because further experiments 
in these systems [89, 90] showed that positronium annihila- 
tes as a whole from the bound state [PsAc]. Tunnelling in 
diffusion stage is but weakly sensitive to temperature [95]. 

Studies of physico-chemical properties of positronium envi- 
ronment. Applications of the positron method to study the 
physico-chemical properties of the environment are discussed 
in a number of review articles [9, 80, 102]. They are devoted 
to applications of the method, some of them dealing with 
the reactivity of complexed ions of transition metals in re- 
lation to both oxidation and spin exchange and the identifi- 
cation of the contribution of quantum-mechanical effects to 
reactions of simple radicals. 

(1) In recent years it has been found that positronium inter- 
acts intensively with diamagnetic organic molecules (aro- 
matic compounds or other conjugate structures). It turned 
out that the activity of these molecules in relation to positro- 
nium is substantially dependent on the distribution of elec- 
tron density in them. The latter can be changed by intro- 
ducing donor and acceptor substituents (—NH,, —OH, 
—Hal, —NO,). In relation to positronium those molecules 
are active in whose structure carbonium-ion (+C:)—triva- 
lent positively charged carbon—is represented with a sub- 
stantial weight. The technique, thus, enables such struc- 
tures to be determined. 
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The sensitivity of positronium to redistribution of elec- 
tron density in a molecule allows one to measure the conduc- 
tivity of organic semiconductors at the level of one poly- 
mer molecule [80]. Clearly, this gives a more accurate indi- 
cation of the properties of a semiconductor of a given chemi- 
cal composition than conductivity measurements on bulk 
polymer. The technique is based on the fact that the rate 
constant for the interaction of positronium with an organic 
molecule increases if the latter includes a strong electronic 
zy 
No- 
and decreases if the action of this group is compensated 
for by the influence of an aminogroup (p-nitroaniline 


acceptor such as a nitrogroup (nitrobenzene ic (OÅ 


+ o 
H,N N ). The compensating action can be reduced 
2 ge P g 


(rate constant increased) if the donor and acceptor groups are 
separated by bridges whose conductivity is estimated. 
(2) The positron annihilation technique has been used suc- 
cessfully to uncover structural transformations (phase 
transitions) in liquid crystals [103]. These crystals have one 
or more mesomorphic (nematic, smectic) phases in addition 
to the conventional liquid and solid phases. Liquid-crystal 
structure is at present assigned to cell membranes in biolo- 
gical objects. Phase transitions change substantially the 
transport properties of membranes. Therefore, an attempt 
was made to observe these transitions. In fact, in 4-butyl- 
hydroxybenzene-4-ethylaniline a new phase was discovered 
in addition to the four previously known ones [103]. Its 
existence was substantiated by the methodJof differential 
scanning calorimetry. In addition, ‘in aqueous dispersions 
of phospholipids the lifetime of positrons was observed to 
change sharply at the temperature of \gel-liquid crystal 
transition. Extensive studies of micellar solutions have been 
carried out in”recent'fyears by Ache and coworkers 
[104], who‘fmeasured positron lifetimes. It turned out 
that at the micellization point the long-lived positron com- 
ponent declines markedly [105]. The effect was observed re- 
gardless of the sign of the micelle charge in aqueous solu- 
tions of sodium dodecylsulphate ([CH,—(CH,),,—S0O,]-Nat) 
and cetyltrimethylammonium bromide ([C,,;H,,—CH,— 
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Fig. 11 Variation of theshare ofthe Fig. 12 Variation of o; and Av/v 
central part of annihilation spect- of the bond H— Hal for the mo- 
rum from the Ge(Li)-detector with lecular complexes H—Hal— 
the concentration of (1) sodium olefin with the ionization po- 
dodecylsulphate and (2) cetyltri- tential of the donor Ig: (4) CH3; 
methylammonium bromideinwater (2)C,H;; (3) C3H;Cl; (4) C3H5F 3: 

(5) CHa; (6) iso-C,H,; (a) 
HClson (b) HClua; (c) HBrsoi 


+ 
—N (CH,),]Br-) in angular correlation experiments using 
Ge(Li)-detector (Fig. 11) [105]. This points to an exceeding- 
ly high capture rate of positronium (but not positrons and 
electrons) by the surface of a micelle. 

(3) The positronium inhibition parameter o, can serve as 
a measure of strength of bonds in donor-acceptor complexes 
(e.g. olefin-hydrogen halide), especially in liquid phase 
where it is difficult to obtain IR-spectra. This follows from 
the observed correlation between o, and the relative fre- 
quency shift Av/v of the H—Hal bond depending on the ion- 
ization potential of donor—olefin J, (Fig. 12) [106]. 
The potential in this case is a measure of the electron-donor 
power of olefin. 

(4) Let us take a closer look at how the positron tech- 
nique allows an estimate of the electron affinity of catalytic 
(so-called proton or Bronstead) centres introduced into sys- 
tems with large specific surface, e.g. into Y-type zeolites 
[107]. This possibility relies on the fact that the rate of the 


reaction 
H+ (sur) + Ps > H + e+ + Ee (24) 


is determined by the binding energy of a proton on a sur- 
face, i.e. by the force of B-centre, and it increases with tem- 


170 V. I. Goldanskii and V. P. Shantarovich 


Count in channel 





Channel number (time) 


Fig. 13 Inhibition of the t,-component by B-centres in switching 
from NaY to HNaY at different temperatures: (1) 25 °C; (2) 80 °C; 
(3) 180 °C; (4, 5) t, at 80 and 180 °C, respectively; 1 channel = 3.3 ns 


perature. Accordingly, achange in the electron-acceptor power 
of proton centres can be pinned down from a change in 
quenching and inhibition of positronium. 

Shown in Fig. 13 is the inhibition of the T,-component by 
B-centres in a sample of zeolite NaY in the hydrogen form 
HNaY as the temperature of the sample is increased. The 
concentration of B-centres was controlled by IR-spectrosco- 
py. 

Assuming oxidation interaction of positronium with a 
B-centre, one can estimate the electron affinity for a B-cen- 
tre (Zpe-) in the system in question. The reaction is written 
as follows: 


-Ey+ 


-W 
H* (sur) + Ps — S Htp Ps —B Ht etter 


| 


| . | Wy (25) 
H (sur) +e ——— Hë. 


Here Wy and Wps are the binding energies for free hydrogen 
atoms and positronium, respectively; Zpn+ and Zu are the 
binding energies of the hydrogen ion and atom with a sur- 
face, respectively. The condition for the reaction to run in a 
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given direction is Epe- > Wps, and then, clearly, 
Wis S< Eg- < Wu. (26) 


The estimate can be improved by writing the enegy balance 
of the cycle (25) 


—Ey+ — Wps + Wu + En = — Wps + Esge- (27) 


Both sides are the energy released in the reaction and real- 
ized in the form of the kinetic energy of a positron 


p?l2m D (Wu — W ps) + (Eu = Em) = = Wps + Epe-. 


It follows from (27) that Ege- = Wy + Eu — Exe. Since 
Ey K Et (Eir 4.5 eV), then the electron affinity of a 
B-centre in the system in question is about 10 eV and it 
changes mainly when the binding energy for OH does so. 

The result obtained drastically enriches our understanding 
of B-centre structure of the Y zeolite. It turns out that a 
B-centre can be treated not only as a proton donor but also 
as an electron acceptor. 

(5) This last example of the uses of the positron techniques 
is concerned with inhibition of positronium formation (see 
Sec. 3.2) and the spur model of this effect. So a distinct cor- 
relation was found [108] between the relative values of rate 
constants for reactions of ‘dry’ (nonsolvated) electrons with 
a variety of acceptors in ethanol, propanol and water and 
the positronium yield in corresponding systems. And so, af- 
ter an appropriate normalization, from positron data one 
can obtain these constants for yet unstudied systems without 
sophisticated and expensive experiments on picosecond 
radiolysis. 

These possibilities can be expanded, namely, one can meas- 
ure absolute (not relative) rate constants of reactions in a 
positron (electron) track. This seems to be possible using a 
combination of measurements of positron reaction rate 
constants (see Sec. 3.5) and the data on suppression of fluo- 
rescence in solutions, which occurs when positrons interact 
with matter [35, 36, 109]; see Secs. 3.2 and 3.3. It was found 
that the mechanism responsible for the fluorescence of the 
solution irradiated by positrons is the recombination of track 
electrons with parent ions, Therefore, the suppression of 
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fluorescence and inhibition of positronium formation by the 
same acceptors, e.g. CCl, and C,H,NO,, inhibition of posi- 
tronium being the strongest when acceptors capture not on- 
ly electrons but also positrons (CC],), is convincing eviden- 
ce for the spur mechanism of positronium formation 


Fluorescence Ps formation 
+M* — ‘M* ——> hv +et — Ps 
7 ees 
Espur Espur (28) 
N LAc > Ac’, NAc > Ac’, 


e* + Ac — (Ace*) — 2y. 
Here *M* is the excited molecule of the fluorescent marker 
in singlet state. 

It turns out that for the acceptors that have the above fea- 
tures (e.g. CCl, and C,H,NO,) the relation of the cross-sec- 
tions of fluorescence suppression o, and positronium forma- 
tion inhibition o, can be expressed as 


Os abs ( Os ) ( kercen j 
Saves =a, VE) „ef Secu 29 
Oy Ae-M+ oO J CCl F ke-CCl4 ee 





where a55 and A.-m+ are the rates of positronium formation 
and e- and M+ recombination in spur; they depend on the 
properties of the solvent and fluorescent marker. 

Using the experimental data and the value ketca, = 
(8.32.7) x 10! 1-mol-!-s-!, found in positron ex- 
periments, we can estimate ke-cc1,=1.7 x 1010]. mol-t. s-t, 
which is close to the rate constant for the reaction of CCl, 
with a solvated electron [113]. Other constants of radiation 
processes can be found from a comparison of sensitization 
of some solutions by weak electron acceptors [36, 109] 
with the ‘antiinhibition’ of positronium formation [114]. 

A greater understanding of fast radiative processes was 
obtained through correlating positron annihilation data with 
the characteristics of the fluorescence they excite (specifical- 
ly with the optical detection of signals of magnetic reso- 
nance of radical pairs produced ina positron track [109- 
111]). For example, experiments on the inhibition and ‘an- 
tiinhibition’ of positronium formation in conjunction with 
the optical detection of magnetic resonance signals in squal- 
ane solutions of additives 1,2,4,5-C,H,Cl, and p-C,H,Cl, 
made it possible to reveal quite short-lived (10-30 ns) ani- 
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on-radicals formed as a result of the capture of track elec- 
trons, to estimate the cross-sections of the capture, and 
to study the influence of local magnetic fields in the track. 
Also were studied delicate track processes of ion-molecular 
charge transfer [111] and the formation of dimer cation-radi- 
cals [112]. 


3.5 Annihilation Without Positronium Formation 


As noted above (see Fig. 1), not all positrons form positro- 
nium in liquids and molecular media. We have already men- 
tioned this when we discussed the uses of positron techniques 
to study polymer materials. In a number of systems positro- 
nium formation does not take place or is artificially inhibit- 
ed or again peculiar positronium-like states, localized or 
delocalized, are formed [115-117]. Generally speaking, here 
we have another sphere, extensive and promising, whose 
detailed discussion however lies beyond the scope of this re- 
view. It can be called the physics and chemistry of slow pos- 
itrons. The issues of the physics of slow positrons and the 
uses of the positrons for probing into the structure of solids 
(metals, ionic crystals, semiconductors) are dealt with in a 
sea of original works and reviews; see e.g. [116]. The chemi- 
cal aspects, however, have not been discussed so extensively 
and so they deserve a brief comment. 

Positronium transformations covered in the previous sec- 
tions have been studied extensively during about 15 re- 
cent years. The same is true of the chemistry of the solvated 
electron ej, [118]. Up until recently, however, almost noth- 
ing was known [119] about the kinetics of reactions of the 
third lightest particle, the solvated positron (eig). But these 
data are of great interest both forthe further quantita- 
tive understanding of the spur mechanism of positronium for- 
mation and for a comparison with the solvated electron re- 
actions. 

The specific features of chemical transformations of 
eig» eig, Ps are concerned with the possibility of quantum- 
mechanical effects: zero oscillations, tunnelling, delocali- 
zation, and anomalously high mobility. It is well known, for 
example, that the electron mobility changes by several or- 
ders of magnitude from ethanol through hydrocarbons to 
liquid krypton [120]. Positronium possesses an anomalously 
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high mobility in frozen solutions [121-123]. It is clear that 
measurements of mobility of the only positively charged 
light particle (positron) can give some clues to the behavior 
of two other particles: the electron and the positronium. It 
is of especial importance for the electron, i.e. when it is 
difficult to measure the constants of interaction rates. 

One other aspect of studies is the possibility directly to 
test the applicability of quantum-mechanical calculation 
techniques with reference to bound systems (Met) by com- 
paring the results with experimental data. 

It is worth noting also that, as shown above, a knowledge 
of reaction rate constants for positron is of value, when it is 
desired to obtain radiation-chemical information from cor- 
relating data on positronium formation inhibition and fluo- 
rescence suppression. 

Eearlier the ‘resonance’ annihilation of positrons on mul- 
tiatomic molecules in gases was observed [124], the annihila- 
tion cross-section being 10-1000 times higher than Dirac’s. 
This was interpreted [125] as a sort of ‘resonance’ 
annihilation conditioned by the existence of polarized, vir- 
tual or weakly bound positron-molecule states. 

Kinetic studies of positron chemical reactions in liquids 
and gases are attended with the difficulty that in the majori- 
ty of these systems there forms positronium and it is fairly 
difficult to isolate the positron component. Since there is 
no way of experimentally separating the lifetimes of free 
positrons in liquids and positrons in a bound state (Me+), 
the problem is solved using the method of angular correla- 
tion of annihilation radiation (see Sec. 3.3). So, paper [126] 
is the first kinetic study of the reaction of a positron with 
Cl- in water: 


et + Cl- > (etCl-) — 2y. (30) 


The positronium component was suppressed by the ion- 
inhibitor NO,. It was found that keci- = (1.50.5) X 
1011]. mol-!-s-!. Separating the spectral components 
of angular correlation of annihilation radiation can conven- 
iently be done using the computer programs PAACDIF 
and PAACFIT [127] and some of the applications of the sta- 
tistical regularization method [128]. 

In some angular experiments it was possible to work out 
the kinetic constants even without inhibiting positronium, if 
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in special experiments the forms of the narrow component of 
the angular distribution (the component being due to sin- 
glet positronium Jy) and of the component of the pick-off 
annihilation of positronium J, had been determined. The 
forms of positron components can be obtained from angular 
curves by simply subtracting the positronium components 
with weights found from measurements of lifetime distribu- 
tions of annihilation [129]. In such a way the rate constant 
for interaction of positronium with CCl, in benzene was 
found: 
e* + CCl, — (e*CCl,) > 2y, 
Kacen = (8.32.7) x 10 1-mol!-s-!. 


The marked difference in the shapes of angular distribu- 
tion for free annihilation of positrons J; and annihilation 
from bound state Ip is indicative of the reaction of binding 
a positron with a CCl, molecule and seems to militate the 
supposition, made by Hatcher [130], that positronium for- 
mation by halogen derivatives is inhibited through the break- 
ing by positrons of the weak bond C—Hal with subsequent 
moderation down to the energies smaller than the lower 
bound of the Ore gap, since in that case the shapes of J; and 
I, must coincide. 

In [131] experiments involving reactions of positrons 
with halogen ions were carried on. At the same time some 
theoretical calculations of annihilation characteristics of 
positronium [132], the Met system, and also the PsM sys- 
tem, treated as etM-, were performed. Their binding ener- 
gies were estimated. The papers by Schrader and Wang 
[133], Madia and Ache [134], Cade and Farazdel [435], 
and Byakov et al. [136] are worth noting. A comparison of 
predictions [134] and [133] with measurements [47] indi- 
cates that the CNDO-2 (complete neglecting of differential 
overlapping) method, suggested by Pople and co-workers 
[437], yields much better results if it uses additional empir- 
ical information about atoms and diamagnetic molecules, 
which are positron acceptors. 

Cade and Farazdel [135] specially computed the angular 
distribution of positron annihilation in bound states with 
halogen ions (etHal-). Good agreement with experiment is 
always observed [126, 131], which is a good test of the approx- 
imation of the multiconfiguration Hartree-Fock method 


(31) 
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used in [135]. On the other hand, this agreement comes a 
bit as a surprise because the curves have been obtained in 
a solution, whereas the calculation has been done for the iso- 
lated system etHal-. It is known, by the way, that the wave 
function of a positron extends far (about 4 Ä) beyond the 
confines of an ion. There is no satisfactory explanation of 
this so far. The results obtained suggest, however, that there 
exists a Specilie (repulsive) interaction of the system 


(etHal- = PsHal) with the surrounding molecules. 

In conclusion, there exists a question of fundamental im- 
portance whose solution may benefit from studies of the de- 
cay characteristics of positrons and mesons. It is concerned 
with the optical activity of compounds that form the founda- 
tion of life on Earth. They, with rare exceptions, consists of 
D-sugars (RNA and DNA) and L-amino acids. It may well 
be asked: Why must such compounds be optically active at 
all, and is there any special mechanism that would account 
for the fact that preference was given to L-amino acids in 
the initially racemic mixture when life made its appearan- 
ce? The answers to these questions can come, in particular, 
from experiments on interaction of ionizing radiations 
(positrons, »*-mesons) with optically active molecules, and 
from observations, say, of asymmetric (with a distorted 
singlet-triplet ratio) positronium (muonium) in these 
systems [138, 139]. 


4. Mesoatoms 


We are going here to give only a brief outline of the chemical 
applications of the results concerned with transformations 
of ut-, u`- and sı-mesons, the relevant general considera- 
tions and experimental arrangements. For more details the 
reader is referred to books and reviews [4, 10-12, 141, 147]. 
In recent years extensive information about the current me- 
son studies has also been available in the periodically pub- 
lished proceedings [149, 150] of the so-called uSR-confer- 
ences. 


4.1 Muonium Chemistry 
Muonium (Mu), along with positronium, belongs to the new 


atoms, the lightest isotopes of hydrogen. Some of the prop- 
erties of muonium have already been discussed early in 
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this review. Whereas with positronium the fate of this exot- 
ic atom can be traced by examining the characteristics of 
annihilation radiation, with muonium and wt+-meson one 
has to rely on other properties. The lifetime of u*-meson is 
2.2 x 10-6 s. Unlike positronium annihilation, the life- 
timeisindependentof the state of meson. But of importance 
for u+-mesons is the orientation of the spin of a meson, which 
points in the opposite direction to the motion of the meson 
after the generation when 1+-mesons decay in flight. Further, 
according to the theory of the two-component neutrino 
(Landau, Lee and Yang, 1957) for the oriented spin of 
ut-meson the probability of ejection of a positron (see (1)) in 
the direction of the meson’s motion is one half that of its 
ejection in the opposite direction, i.e. the maximum asym- 
metry coefficient C, defined as the ratio of the difference of 
decay numbers backwards and forwards to their sum, is 
1/3. When orientation is lost (depolarization), the asymmet- 
ry coefficient goes down to zero (for totally depolarized me 
sons), i.e. the probability of positron ejection becomes spa- 
tially isotropic. 

One other important property of wt+-meson and muonium 
is that in a transverse magnetic field H their spins rotate 
with the Larmor frequency œw = eH/2mc, where e isthe elec- 
tron charge, and for p+ m = m, is the particle’s mass. 
For muonium the precession occurs with the electron fre- 
quency (m = m,). As a result, in weak magnetic fields the 
muonium frequency of precession is 103 times higher than 
the meson one. The two particles appear thus to be easily 
distinguishable. It is on these facts that the techniques 
of investigating into meson transformations are 
based. 

An installation includes a target (sample), in which polar- 
ized mesons are moderated, and a system for counting po- 
sitrons (that are produced in decays of u+-mesons and trav- 
el in a given direction) depending on the time they spend 
in the sample. Just as in positron experiments, this time is 
measured using a time-to-amplitude converter; p-mesons 
are obtained in decays of m-mesons in a beam produced by a 
high-power accelerator. The precession of the spin of a 
ut-meson about the direction across the magnetic field shows 
up as oscillations of the counting rate of positrons in a given 
direction (Fig, 14) [4]. These oscillations can be described by 
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the following equation: 
N (t) = N {B + exp (—t/t,) L1 -+ A, cos (yt + $y) 
+ A, exp (t/T2) cos (wat + P2)]}; (32) 


where N, is the normalization factor; B is the background 
of random coincidences; t is time, t, is the lifetime of 
ut-meson (2.2 x 10-8 s); A,, œ; and y, are the amplitude 
(asymmetry), frequency and phase of the meson Larmor 
precession (13.55 kHz/G), respectively; and A,, @ and @, 
are the respective parameters for the muonium precession 
(1.394 MHz/G). Damping of oscillations at muonium fre- 
quency as a result of the interaction of this atom with the 
environment is characterized by the relaxation time T,. 

If one expresses the speed of disappearance of the muonium 
signal (Tz!) as a function of the concentration of the active 
additive [Ac] introduced into the solution, the rate constant 
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Number of channels 


Fig. 14 Precession curve for 1*-meson in CHBr, [4]: the abscissa axis 
is the time ¢ (in the number of channels in the analyzer), the ordinate 
axis is the number of w*-meson decays at t in a given direction 
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of interaction of muonium with the additive can be given by 
1 1 1 

Klar. (38) 

where T$ is the lifetime of muonium in a pure solvent. 

Muonium atoms form when pt-mesons retard in matter; 
during this process an electron is captured and lost several 
times. Because charge-exchange cross-sections attain gas- 
kinetic values, it is generally believed that for substances 
for which the ionization potential is lower than that of muo- 
nium (13.6 eV) pt-mesons will slow down to thermal ener- 
gies and appear as muonium atoms. But it isonly recently 
that the sophistication of meson experiments has reached the 
level enabling accurately to measure the muonium yield in 
liquids [154]. Up until that time the applicability of the 
Ore scheme could only be tested for inert gases He and 
Ne, for which the probabilities of muonium formation differ 
markedly from unity. On the basis of new experimental evi- 
dence attempts are at present made to evaluate the possible 
role of spur processes in muonium formation [152, 153]. 
As with positronium (see Sec. 3.2), the spur mechanism is 
compared with the model of muonium formation by ‘hot’ 
mesons. But, as noted in [154], both models are still fairly 
crude to account for all the body of experimental data, e.g. 
for the difference in the action of MnOz, NO3 and Cd?* 
on the behavior of muonium in water [155]. 

In the USSR extensive research on muonium chemistry 
was done by V. Firsov and L. Ponomarev with their co-work- 
ers, and we will now give some examples of their results. 

In experiment the various types of muonium interaction 
manifest themselves in variety of ways in the depolarization 
spectrum. If the result of a muonium reaction isa diamagnet- 
ic product of the type MuH or MuCl, the Larmor frequency 
coincides with that characteristic of n+-meson. An example 
of this reaction is the interaction of muonium with satu- 
rated aliphatic hydrocarbons: 


K 
Mu--R—CH, — MuH+R—CH;. (34) 
Another form of chemical interaction is the addition to a 
multiple bond with the result that muonium becomes a 
neighbor of an unpaired electron: 
K 
Mu+R=CH, — RMu— CH}. (85) 
120 
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Fig. 15 Variation of polari- 
zation at the meson frequen- 
cy with the number of double 
bonds k in a six-membered 
hydrocarbon ring [132] 





Under these conditions the muonium spin depolarizes quick- 
ly, and in a magnetic field no contribution will be made to 
the precession at the meson frequency. 

Other two channels of muonium reactions are spin ex- 
change (conversion) and oxidation (electron transfer). 

By way of example we can take the interaction of muoni- 
um with alkylbenzene [147] with a side chain of ever increas- 
ing length (Table 4). It was found that K, increases while 


Table 4 Rate Constants for Interaction of Alkylbenzenes 
with Muonium, 1-mol-1-s-! [147] 








Compound (Ki+K,)x10-° K,/Kı | Kıx10 
CoH 3.440.3 10.5+4.2 0.30+0.11 
CHCH; 3.30.4 3.31.0 0.71740. 20 
C.H; CoH, 3.60.8 2.4+0.8 1.05+0.30 
CsH;C,H, 4.34+0.9 1.4+0.3 1.80.6 

eH CeHis 5. 744.0 0.7840. 22 3.240.7 
CH,CH(G;H,)C5H, | 4.340.9 1.40.3 1.80.6 


K, is immune to the influence of the side chain. The result 
obtained suggests that muonium interacts not with the entire 
molecule but with some of its chemically active fragments. 
If the chain is branched, the interaction rate is conditioned 
by the maximum linear dimensions of the chain rather than 
by the total number of hydrocarbon bonds in it. 


State-of-the-Art in ‘New Atom’ Rescarch 181 


For muonium interactions with hydrocarbons containing 
a six-membered cycle with ever increasing number of double 
bonds (from cyclohexane to benzene), the rate of addition of 
muonium to a double bond increases monotonically (the po- 
larization P at the meson frequency decreases) (Fig. 15). 
The only exception from this sequence is 1,4-cyclohexadiene 
with an unconjugated pair of double bonds. 

Exchange interaction with paramagnetic additives in a 
solution (e.g. Fe+) reduces the time T, of the spin relaxa- 
tion of a u+-meson. 

Besides chemical applications the method of measuring 
the times of spin relaxation of mesons is suitable for inves- 
tigations into the magnetic properties of some solids. For 
example, the results obtained for a-Fe,0, both above and 
below the Morin temperature show that the local magnetic 
fields affecting mesons undergo a two-fold change in the 
process [156]. 

Consequently, as in the case of positronium, the meson 
characteristics appear to be sensitive to the physico-chemical 
properties of the medium, and so they can be used to study 
these properties. The technique of computing the rate con- 
stants for muonium reactions is due to Firsov [157, 158]. 
Of special interest is a comparison of muonium reactions 
with atomic hydrogen reactions. This might provide an indi- 
cation of the reactivity of atomic hydrogen, and unlike pos- 
itronium the analogy here is still more close, since the quan- 
tum-mechanical effects, which complicate the picture, are 
virtually absent here. 


4.2 Negative Meson Chemistry 


Earlier in the review we have already noted that new atoms 
can be ‘obtained’ from the hydrogen atom not only by re- 
placing the proton by a lighter positive particle (et, ut), 
but also by replacing the electron by a heavier negative 
particle (u-- or n~- meson). Before mesohydrogen forms, mes- 
ons are moderated in matter by ionization of the environ- 
ment. Approaching the hydrogen nucleus a meson screens 
its charge with the result that the electrons leave the atom. 
Mesons are initially captured at a high energy level with the 


quantum number NV, ~ Vu*, where p* is the reduced mass 
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of the mesohydrogen atom in terms of the electron rest mass. 
For u” and n”, N, is 14 and 15, respectively. The transition 
of mesohydrogen atoms into the ground state occurs through 
radiative cascade transitions. Owing to special selection 
rules the average time per entire cascade is 10° s for pn“. 
The observed times are much shorter, however. This is ex- 
plained by the transfer of excitation energy to the electrons 
of the neighboring atoms. 

When a mesohydrogen atom collides with the nuclei of 
other atoms a transfer of a meson may occur: 


pu-+Z—> p+ Zw, 
PW +Z => p+ Zw. 


If mesons attain the ground state of the meson atom, they 
are either captured by the nucleus or spontaneously disinte- 
grate. When captured a u--meson interacts with the proton 
of the nucleus so that there emerge a neutron and an anti- 
neutrino. The capture of a m--meson by the nucleus with 
Z œ í results in the decay of the nucleus. In the case of the 
mesohydrogen atom the m~-meson interacts with the proton 
through the so-called charge-exchange reaction, which yields 
a neutron and n°-meson. The rate of this reaction is many or- 
ders of magnitude higher than that of the decay of m--meson. 

It was initially believed that the capture of mesons into 
mesoatom orbits must be independent of the nature of the 
chemical bonds involved, since the mesoatom orbits lie 
within K-shells of atoms. There is unequivocal evidence 
from experiments, however, that testifies that the chemical 
structure influences these processes. 

Depolarization of negative muons. Interesting experimen- 
tal and theoretical studies of the depolarization of negative 
muons in various systems were performed in the Laboratory 
of Nuclear Problems of the Joint Institute for Nuclear Re- 
search by V. Evseev, M. Frontas’eva, E. Tammet and others 
1159-162]. 

Among other things they have studied interactions of 
u-mesoatoms of carbon, oxygen (mesoboron and mesonitro- 
gen) in solutions with molecules of nitroxyl radical, tria- 
cetonamine and methyleyclohexane [160, 161]. They also 
found the absolute values of rate constants for corresponding 
reactions. These results are quite interesting because they 


(36) 
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enable one to obtain the rate constants for reactions of con- 
ventional atoms (nitrogen and boron). These studies would 
be of special interest for mesohalogens (e.g. mesofluorine 
in neon). Methodologically, these experiments are, however, 
too complicated. But in this case experiments were concer- 
ned with the difference in the rates of interaction of mesoat- 
oms with nitroxyl radicals in water and benzene. Experi- 
ments with triacetonamine and methylcyclohexane were of 
secondary importance and they established that the differ- 
ence in changes of residual polarization with increasing rad- 
ical concentration in water and benzene was due to the dif- 
ference in reactivities of mesoboron and mesonitrogen in re- 


lation to the reaction centre SN-0'. 


The observed ratios of rates of all reactions of mesoatom 
for a solvent and nitroxyl radical made it possible to get 
interesting information concerning the concentration of the 
radicals in the vicinity of the mesoatom (H', 0°‘, OH’) 
at the beginning of the reaction. In this way information is 
obtained about track processes. 

Another important result was the examination of the con- 
centration dependence of the residual polarization of u-mes- 
ons in the system C,H, -+ CCl, [161]. It suggests that mes- 
oboron enters into a chemical reaction with chlorine. The 
yield of radicals typical of CCl, in the radiolysis of the solu- 
tion is sensitive to the rate of excitation energy transfer 
from benzene to CCl,. Experiments on the influence of a 
magnetic field normal to the spin of the muon on the resi- 
dual polarization in CCl, have shown that the rate of the 
radical formation process, which is controlled by the inter- 
molecular energy transfer, is lower as compared with the ra- 
te at which mesoboron reacts in the solution. It is important 
in the context of checking on the mechanisms of energy trans- 
fer and radical formation in the mixture C,H, + CCl,. 

The issue of energy migration in complex molecular sys- 
tems is also taken care of by the research of the depolariza- 
tion of negative muons depending on the length of the hydro- 
carbon chain of aliphatic alcohols, paraffins, chlorine alkyls 
and benzene derivatives [161]. In the case of substituted hy- 
drocarbons comparison of the residual polarization with the 
Hammet constants and the intensity of radical formation 
in the vicinity of a substituent shows that evidence from re- 
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sidual polarization studies really characterizes the relative 
power of energy transfer along the molecular chain. 

u-SR-studies. Chemically anomalous compounds of the 
type uC (mesoboron) also are unique in that they enable 
the nature of chemical bonds that include u-C to be studied 
through precision measurements of precession frequencies of 
the w--meson spin in a transverse magnetic field. So, this 
method was used [163] to study highly oriented pyrolytic 
graphite and intercalated rubidium compounds in this sys- 
tem. It appears that 1°C in the graphite two-dimensional 
lattice is bound through SP?-hybride orbitals with two neigh- 
boring carbon atoms, as in three-coordinated organic boron 
compounds. Therefore, the paramagnetic shielding of the 
nucleus at the u C site must be anisotropic. In this case, 
the anisotropy of precession frequencies can yield informa- 
tion about the details of valence bonds, about their changes, 
for example as a result of intercalation. Let us take a closer 
look at it. The precession frequency w is expressed as the 
deviation ô from the standard frequency w, found from the 
g-factor of the w--meson associated with C (13.5 kHz/G): 
ô = (o, — o)/o. 

In graphite the major contributor to the precession fre- 
quency displacement is the paramagnetic component 
oP); according to the method of magnetic orbitals its differ- 
ence 6, — 6) = ore off) (the subscripts || and 1 
signifying the orientation of an external magnetic field paral- 
lel and perpendicular to the C-axes, respectively) turns out 
to be caused by a ‘disbalance’ P in the electron density on 
2P-electron orbits. According to [163]: 


ol?) | = — (e?h?/m2c2A) (1/73), PE ". 


Here (1/r?), = 4.11 x 10% cm? is the 2p atomic wave func- 
tion averaged over the radial part; A is the average energy 
of electron excitation. Now, PŁ is mostly sensitive to the 


electron density of P,-orbitals, whereas p! is determined 
exclusively by the nature of o-orbitals. 

Measurements of 6, and ô, and calculations using the 
above relationship at A = 5.1 eV suggest thatthe contribu- 
tion of P,-orbitals to S?P-hybride bondsis P,, = 0.1, which 
is characteristic of boron-organic compounds, 
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And the change of ô with rubidium intercalation suggests 
that the relative contribution of P,- and P,-orbitals differs 
from the ideal S2P-hybridization, i.e. o-bonds in the graphite 
lattice are distorted. This circumstance perhaps explains 
the hitherto-unclear phenomenon of staging [164]. 

This all illustrates the kind of specific and interesting in- 
formation that can be obtained from mesoatom chemism. 

Nuclear capture of m~-mesons by protons. We hav2 mentio- 
ned earlier the charge-exchange reaction that occurs when 
a proton captures a m~-meson: 


n+ p—>n+ n. (37) 


——>2y 


It is used to study the state of hydrogen in various chemical 
compounds. The probability of this reaction can be meas- 
ured with arelative error of less than 10-4. So high an accura- 
cy stems from the fact that a destruction of n°-meson gives 
rise to two annihilation quanta. These quanta are emitted 
in opposite directions and can be easily singled out from a 
background using the coincidence method. 

Let us now look at what determines the probability of re- 
action (37) on the molecule Z,,H,, which has m atoms with 
the nuclear charge Z and n hydrogen atoms. Experiments we- 
re performed to compare the probability of the reaction for 
various objects, including chemical compounds and mixtu- 
res of equivalent composition (e.g. for the mixture 2H, + 
N, and hydrazine N,H, [165]). Their results showed that 
for the mixture the probability is much higher. In our exam- 
ple W (N, + 2H,)/W (N,H,) = 30. This suggests an influ- 
ence on the meson capture of the molecule’s valence shell 
as a whole and can be explained in terms of the idea of 
‘large meson molecules’ ZyııH,„ [166]. 

A molecule is termed large because some of the mesons 
can be captured at high energy orbits, which are common for 
the entire molecule. These orbits lie near molecular orbitals 
and they are filled when the valence electrons are substituted 
(their total number is 2n). Another share of mesons is cap- 
tured by isolated orbits of the Z atoms (Zn). The px~ orbits 
cannot be occupied directly since the only electron of the 
hydrogen atom is employed to form a chemical bond, 
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Then the probability for mesons to be captured at orbits 
common for the entire Z,,H, molecule will be 


W, = 2n/{mZ + n). (38) 


In the next stage of the process mesons are captured by the 
isolated orbits pr” and Zn”, the probability W, of the cap- 
ture by the px- orbits is relatively small and declines quick- 
ly with nuclear charge Z: W, œ Z-?. 

In principle, only the proportion W, of the mesons cap- 
tured by the isolated prn- levels will be involved in reaction 
(37), since the meson transfer reaction (36) can also occur, 
although with a lower probability (W, œ 1). 

The total probability of reaction (37) for Z,,H, thus has 
the form 


W = W,W.W, = anZ-?/(mZ +- n), (39) 


which is in good agreement with experiment. The coeffi- 
cient @ here takes care of the nature of the hydrogen bond and 
correlates with the ionic nature of the bond. As ionicity 
increases œ falls off. 

Found from experiment, this coefficient can be used to 
characterize the nature of bonds in molecules. By way of 
illustration we can take the correlation between a and the 
dissociation constants of acids pK, [147, 167] (Table 5). 


Table 5 Correlation Between Probability of n--Meson Capture 
by Protons and Acidity [147] 





Wx10*® | a | 


Acid pKı 
M,SO, <0.4 <0.06 =3.0 
HNO, <0.6 <0.12 1.4 
H,C,0, <0.1 <0.1 1.3 
H,SeO, 0.9+0.5 0.17+0.08 2.6 
H,PO, 1.6-£0.4 0.17+£0.04 24 

„BO, 4,241.0 0.29-£0.05 8.7 
Al(OH); 5.61.0 0.48£0.09 9.2 


The stronger an acid, i.e. the stronger the ionic nature of 
the OH bond, the lower the coefficient a, which reflects 
the electron density near an H-atom. 

A distinct correlation was also found between the Ham- 
mett constants and @ of the methyl group in CH,X systems, 
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where X stands for —H, —(CH,),5n, —NH,,—C,H,, and 
a number of other groups. Figure 16 is a plot of a similar 
linear dependence of a@y- 9 on the protolytic constants 
pK, of organic acids [147]. It should be emphasized that 
such correlations can be successfully used to work out phy- 
sico-chemical parameters when for some reason or other con- 
ventional methods are no good, because the probability of 
the capture of a x--meson is independent of the physical 
conditions. 

Lastly, the method under consideration provides good pos- 
sibilities for investigation into hydrogen bonds. 

u~-Meson X-ray spectroscopy. Formation of w--mesoatoms 
can be traced from the characteristic X-ray emission in ra- 
diative transitions from excited states. 

Because a p.~-meson interacts with the nucleus relatively 
weakly, this phenomenon is less distorted by the background 
from the nuclear capture than in the case of m~-mesons. 
This method is for all practical purposes analogous to the 
well-known method of electron X-ray spectroscopy. The 
emission is detected by a scintillation (Nal(Tl)) counter or 
a Ge(Li)-semiconductor detector. 

It might appear that the parameters of the radiation must 
be independent of the chemical structure of the target. In 
reality, however, the intensity and wavelengths of the radia- 
tion are very sensitive to the physical and chemical proper- 
ties of the environment. If a mixture or compound are rep- 
resented by the formula Z,Z,,, then, assuming that the Fep- 
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mi-Teller Z-law works [168] the ratio of capture probabili- 
ties will be 
n IZ) k Z 
A (ZIZ) = — Ta" Gr (40) 
where J is the intensity of the radiation. 

This law works, however, only for mixtures of elements or 
ions. In other cases the dependence will be more complica- 
ted. For a mixture of oxides, for example, the ratio A (Z/8) 
displays an interesting periodic dependence on Z [169]. 
Minima of curves in this series coincide with the beginning 
of periods in the periodic system. This again suggests that 
the atomic capture of p~-mesons is also sensitive to the state 
of the electron shell. 

This influence is apparent not only in the total intensity 
of the meson series of X-rays but also in the relative contri- 
bution of their individual components. 

The influence of the chemical structure on the intensity 
and composition of the X-ray radiation can, just as with the 
nuclear capture of x--mesons, be explained on the assumption 
of ‘large mesomolecules’ (see Sec. 4.3). 

Although the observed effects still remained quantitati- 
vely unexplained, it is quite obvious that the spectroscopy 
of meson X-ray radiation is a promising technique for quali- 
tative and quantitative chemical analysis. It is of particu- 
lar interest that the high penetrating power of X-ray radia- 
tion makes the method suitable for bulk samples. 


5. Conclusion 


We have got a glimpse of a quite interesting area of research, 
which came into being at the junction of several discipli- 
nes: elementary-particle physics, solid-state physics, and 
chemical kinetics. Referring to a number of examples we 
have seen how the employment of modern facilities of nu- 
clear physics makes it possible, on the one hand, to identify 
specific formations—‘new’ atoms (positronium, muonium, 
ı-mesoatoms), to study their transformations; on the other 
hand, to obtain valuable information about the structure 
and chemical properties of substances from the influence 
of the medium on the transformations of the new atoms, 
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Analysis of intrinsic characteristics of the new atoms and 
bound states of positrons and comparison of these character- 
istics with predictions are also of great interest as a check 
on several model concepts and calculating techniques [170]. 

It has not been the aim of this review to go into details 
of experimental arrangements and theoretical interpreta- 
tions. The bibliography contains references to special re- 
views and original works to which the reader is referred for 
more profound treatments of the issues covered. It is seen, 
however, from the review that studies of the physics and 
chemistry of the new atoms at this stage not only made it 
possible to formulate the principal underlying concepts, 
but also resulted in a quite promising method of probing 
into the physico-chemical properties of matter. And much 
of the information is either inaccessible with other methods 
or is obtained much simpler. 


Beyond doubt, further development of experimental tech- 


niques and the joint efforts of physicists and chemists 
will expand these possibilities. 
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Quantum Electronics 


V. S. Letokhov 


The advent of quantum electronics was an inevitable and 
logical stage of the growth of applications of the laws of 
quantum physics, which is with good reason considered one 
of the major triumphs of the natural sciences of the 20th 
century. 


l. From Quantum Physics 
to Quantum Electronics 


In 1900 Planck introduced into physics the concept of the 
quantum, an indivisible lump of energy that can be ab- 
sorbed or emitted in the process of radiation. In 1905 Einstein 
introduced the concept of light quantum, or photon, as a 
real particle of electromagnetic field. Then Bohr considered 
the process of photon emission and came to the following 
fundamental conclusion. The atom is characterized by a set 
of certain energy levels, i.e. values of its total energy Æ, 
E,, E,, and so on. To each such level there corresponds a 
stationary state of the atom in which it does not emit; emis- 
sion only occurs in a stepwise, or quantum, transition of the 
atom from one state to another, the radiation frequency œ 
being determined by the difference of the energies of the ini- 
tial and final energy levels: œ = (E, — E,)/k, where h 
is Planck’s constant 

The next important step was made by Einstein in 1916 
in his famous work Emission and Absorption of Radiation Ac- 
cording to Quantum Theory [1], in which he introduced two 
types of quantum transitions in the atom or molecule be- 
tween discrete quantum states. The first type is the spontane- 
ous transition of an excited atom into a state with a lower 
energy accompanied by the emission of a light quantum. 
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Einstein wrote [1]: ‘This transition occurs without external 
influences. It can hardly be imagined that it is similar to 
anything other than radioactive decay.’ The second type is 
the induced transition between energy states. It is caused by 
a radiation to which the atom is exposed and its probability 
is proportional to the radiation intensity. Owing to an in- 
duced transition an excited atom gives off an additional light 
quantum and an unexcited atom, conversely, absorbs such 
a quantum. 

Einstein introduced the notions of spontaneous and in- 
duced transitions using his amazing intuition. The only proof 
was that using these notions he easily derived Planck’s 
formula for the spectrum of thermal radiation. An impecca- 
bly objective researcher, he noted [4]: ‘Of course, I am wil- 
ling to recognize that the three hypotheses concerning spon- 
taneous and induced radiation by no means become convinc- 
ing facts because they lead to Planck’s radiation formula. 
However, the simplicity of the hypotheses, the generality 
and easiness of the treatment ... enable me to consider it to 
be quite likely that this treatment will become the core of 
the future theoretical concepts.’ 

Einstein appeared to be absolutely right. In 1925-26 the 
efforts of remarkable international constellation of physi- 
cists resulted in the birth of quantum mechanics, which pro- 
vided the necessary theoretical basis for a strict description 
of the interaction of light with atoms and molecules. Before 
long, in 1927, the outstanding British physicist P. Dirac, 
one of the fathers of quantum mechanics, constructed his 
quantum theory of radiation [2] drawing on the main equa- 
tion of quantum mechanics—the Schrédinger equation—and 
Einstein’s concept of light quanta. Dirac rigorously proved 
the validity of Einstein’s hypotheses concerning spontaneous 
and induced radiations and predicted the main properties 
of these phenomena. He showed that a photon whose emis- 
sion is induced by another photon has with the latter abso- 
lutely identical characteristics—the direction of travel, ener- 
gy or frequency, polarization. It is on this identity of pho- 
tons of induced radiation that all quantum electronics rests, 
notably lasers—sources of coherent light. And the word 
‘coherence’ itself means that clementary quantum emitters— 
atoms and molecules—emit identical quanta, i.e. they emit 
in sympathy, in synchronism. 
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But the phenomenon of induced radiation as such does 
not enable one to control the radiation process, nor to ob- 
tain powerful light radiation. Really, Einstein postulated 
that there exist two types of induced transitions, those 
with emission and with absorption of photons. Under nor- 
mal equilibrium conditions the number of atoms on any ex- 
cited energy level is always smaller than on any lower level. 
Therefore, the number of induced transitions with photon 
absorption in matter always exceeds the number of such 
transitions with photon emission. This is manifested in an 
obvious manner—all the objects around us are absorbers of 
radiation. 

As early as the 1930s it was felt that the situation can be 
changed in a fundamental way if a nonequilibrium distribu- 
tion of atoms over excited states was somehow attained, so 
that in one of the excited states there would be more atoms 
than in a lower one (see the chronology of developments in 
[3]). This nonequilibrium distribution of atoms over energy 
levels is now termed the population inversion of levels. 
Among those who pioneered such ideas in the 1930s was the 
Soviet scientist V. Fabrikant. He even went so far as to suggest 
some techniques of achieving population inversion (see the 
bibliography in [3]). With population inversion of two levels 
induced transitions with photon emission dominate over 
induced transitions with photon absorption, and so in this 
nonequilibrium medium an incident flux of photons must 
be amplified due to the birth of identical photons. 

The idea of light amplification by induced transitions in a 
medium with population inversion paved the way for pro- 
duction of fluxes of identical photons, but subject to one 
condition: the input to the amplifying medium must be a 
flux of identical photons as well. Otherwise, the output flux 
will reproduce all the imperfections of the input. But all 
the existing light sources were producing chaotic radiations 
utterly unsuitable for isolating a suitable flux of identical 
photons to be further amplified. Another fundamental de- 
velopment was placing the amplifying medium into a res- 
onator with a loss factor of photons lower than their ampli- 
fication factor, or gain. Then amplification can be started 
by one or several spontaneous photons, which repeatedly pas- 
sing through the amplifying medium will produce a sea of 
their replicas. This chain process of sustained multiplica- 
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tion ofidentical photons is generally called generation, and the 
device an optical quantum oscillator, or laser. But the idea of 
quantum oscillator of coherent light came into optics from 
radio. In essence the discovery of the laser was a happy 
marriage of the key ideas of optics and radio. 


2. Why Maser First, not Laser? 


After Hertz had confirmed experimentally the existence of 
electromagnetic waves that were predicted by Maxwell and 
have the same nature as light waves, physicists started to 
conquer the radio range. Radio engineering went the path 
of build-up of the power of radio oscillators, the improve- 
ment of the sensitivity of radio receivers and the shortening 
of wavelengths of radio waves employed. The needs of 
radar led to the use of centimetre radio waves. Microwave 
oscillators appeared to be useful tools for systematic studies 
of absorption spectra of molecules in the radio range, to give 
rise to radio spectroscopy [4]. This perhaps was the first 
synthesis of quantum-mechanical concepts, which accounted 
for the microwave spectra of molecules splendidly, and ex- 
perimental radio engineering. 

Radiospectroscopic research was analogous to the study 
of the optical spectra of atoms and molecules, but now exper- 
imentalists had at their disposal radiation sources with a 
much higher spectral brightness than in optics. Since spon- 
taneous transitions in the radio range are highly unlikely, 
the interaction of radio waves with molecules was dominat- 
ed by induced transitions. Along with direct (absorbing) 
induced transitions were observed reverse (radiative) tran- 
sitions of excited molecules predicted by Einstein. This was 
a second fruitful synthesis of quantum-mechanical predic- 
tions with the experimental potentialities of radio engineer- 
ing. Experiments with molecular beams allowed inhomoge- 
neous electric fields to be used to sort out molecules in vari- 
ous quantum states. Drawing on these results the Soviet 
physicists N. Basov and A. Prokhorov simultaneously with 
the U. S. scientist C. Townes suggested and realized in 1954- 
55 the idea of the molecular oscillator (maser) [5-7]. This 
fundamentally new type of electromagnetic oscillator has 
the following principle: at first ‘active’ (excited) molecules 
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Fig. 1 The sequence of discoveries in optics and radio that have led 
to the birth of quantum electronics 


are prepared, which then give off energy in a cavity resona- 
tor used to sustain electromagnetic oscillations. 

In microwave quantum electronics were developed low-noise 
quantum amplifiers on paramagnetic crystals [8, 9] and 
the highly frequency-stable atomic hydrogen quantum 
oscillator [10]. But the most spectacular breakthroughs of 
quantum electronics lay ahead—the extension of the maser 
principle to include the optical range [11]. Instead of the 
cavity resonator A. Prokhorov suggested in 1958 to use in 
the short-wave range the open Fabry-Perot resonator [12]. 
Soon, in 1960-61, the first optical masers, or lasers, came 
upon the scene [13, 14]. 

Figure 1 shows the evolution of key concepts, ideas, and 
experiments, which eventually gave birth to quantum 
electronics. It is clearly seen how a possibility was not rea- 
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lized of discovering the laser by way of extending the con- 
cepts of light amplification in a medium with population in- 
version. 

The principle of the laser is much simpler than that of 
the maser, and it now seems astounding that science had 
to make such a zigzag on its way to a coherent light source. 
This can be explained in different ways. Early lasers used 
ideal crystals or high-reflection mirrors, which were unavail- 
able back in the 1920s and 1930s. But this is hardly of 
principle, since now we can improvise a laser, e.g. a gas- 
discharge pulsed CO,-laser in the infrared or an N,-laser in 
the ultraviolet, from materials available in a physics labo- 
ratory even back in the 1920s. It is quite likely that experi- 
mentalists chanced upon laser-like phenomena but ignored 
them. If in an experiment of that type stimulated emission 
had been observed, it could perhaps be explained, since the 
theoretical basis for that was already there. This discovery 
might be a matter of chance, but it would be of no conse- 
quence for the birth of quantum electronics. 

Another historical factor is also worth mentioning. In 
the 1930s and 1940s most active researchers in physics were 
concerned with nuclear physics and engineering. This in- 
evitably caused some ‘brain drain’ from other physical dis- 
ciplines, notably from atomic and molecular physics and 
optical physics. Some role was also played by the boom in 
nuclear physics and radar during World War II. This gave 
more weight to the development of radio as compared with 
optics. This, obviously, also goes to explain the zigzag in 
Fig. 4. 

Nevertheless, the birth of quantum electronics is a good 
example of the development of a new science along a logical 
path, the one predetermined by the course of the evolution 
of science and technology as a whole. The inextricable links 
between the various branches of the ‘tree of science’ make 
such a discovery inevitable, even if chance does not come to 
help. So with quantum electronics, the first to come was theo- 
ry, in this case the theory of induced emission [1, 2]. Then 
experimental techniques were produced to observe the 
predicted phenomenon, in this case microwave oscillators, 
which made possible the radiospectroscopy of molecules 
and eventually the observation of stimulated transitions. 
Further, the concept of positive feedback, so common in 
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radio, was fused in a natural manner with the phenomenon 
of induced emission of excited molecules. As a result, the 
maser was created [6-7]. In the wake came the laser, but this 
was a logical development in quantum electronics, a branch 
of physics whose emergence was marked by the Nobel phys- 
ics prize in 1964 [15]. 

To conclude this brief sketch of the ‘genesis’ of quantum 
electronics, we will provide another proof that it would 
have been inevitably discovered in the late 1950s or early 
1960s owing to the intimate connection between various 
branches of science and technology. Within 10 years after 
the laboratory maser had been produced, a natural cosmic 
maser was discovered that radiated at the quantum transi- 
tion of the OH radical (A = 18.5 cm) in galactic nebulae 
[16]. The exceedingly high temperature of the radio emis- 
sion of hydroxyl molecules (T,, S 10!° K) was immediately 
accounted for using the maser principle—the amplification 
in the medium of molecules with population inversion of the 
transition energy levels corresponding to the frequency of 
the spectral line being amplified. Suppose the laboratory 
maser had not been constructed by that time. Then it would 
have been inevitably discovered owing to radioastronomical 
observations. 


3. Lasers—the Mainstay of Progress 
in Quantum Electronics 


The advent of early optical quantum oscillators late in the 
1950s and early in the 1960s marked a revolution in the 
young science of quantum electronics. Using Einstein’s pro- 
cess of induced, or stimulated, emission it tackles the prob- 
lems of production, control and use of the light energy. Lasers 
are, in essence, converters of energy of relatively low quality 
(thermal, chemical, electrical—depending on the way in 
which population inversion in a medium is obtained) 
into the energy of coherent radiation (a flux of photons iden- 
tical in all respects), which can be viewed as a higher-quali- 
ty form of energy. 

The high power of laser radiation, the high directivity 
and monochromaticity, the possibility to form short pulses 
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of coherent light and to adjust the frequency of radiation 
make coherent light a highly flexible tool suitable for a wide 
variety of applications. Quantum electronics made it pos- 
sible for the first time at will to transform and concentrate 
in space, time and spectrum the energy of light to the same, 
or even higher, degree than it has been possible, for example, 
with electromagnetic energy in lower-frequency ranges— 
from d.c. to microwaves. The present period of mastering 
the electromagnetic oscillations in the optical range seems 
to follow logically the century-long period of the discoveries 
and development in electricity, and it is now even diffi- 
cult to predict all the implications of these developments. 

During the last 20 years a multitude of various lasers have 
been produced for an infinite variety of purposes, which 
differ markedly in their characteristics [17, 18]. Above all, 
lasers have different modes of operation. There are lasers of 
continuous, pulsed and pulsed-periodic operation. Pulsed 
lasers are designed to produce pulses with high peak powers 
(up to 1013 W on unique facilities), short duration (down to 
fractional picoseconds), and high energy (up to tens of kilo- 
joules on unique facilities). Continuous-wave lasers are de- 
signed to produce continuous radiation of power ranging wide- 
ly (from fractional milliwatts to hundreds of kilowatts for 
different laser media). They have high frequency stability 
(up to relative instabilities of 10-#). 

To be sure, it is never possible to have all the extremes of 
characteristics in one and the same laser, and, as a rule, 
each specific laser is constructed to obtain coherent light 
either with several satisfactory parameters or with one ex- 
treme parameter, the other parameters being moderate. The 
latter situation is typical of lasers designed for a concrete 
application. For example, high-temperature heating ofa 
plasma requires high-power short pulses of high energy and 
adequate spatial coherence to be able to focus highly in- 
tense radiation on a small area, the spectral width and wave- 
length of the radiation, even within a wide range, are im- 
material. Another example: thermal processing of materi- 
als (welding, cutting, hardening) requires continuous-wave 
lasers with high power and lenient requirements to radiation 
spectrum. 

Active (amplifying) media in lasers may be various sub- 
stances in various states of aggregation [17, 18]: crystals 
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radio, was fused in a natural manner with the phenomenon 
of induced emission of excited molecules. As a result, the 
maser was created [6-7]. In the wake came the laser, but this 
was a logical development in quantum electronics, a branch 
of physics whose emergence was marked by the Nobel phys- 
ics prize in 1964 [15]. 

To conclude this brief sketch of the ‘genesis’ of quantum 
electronics, we will provide another proof that it would 
have been inevitably discovered in the late 1950s or early 
1960s owing to the intimate connection between various 
branches of science and technology. Within 10 years after 
the laboratory maser had been produced, a natural cosmic 
maser was discovered that radiated at the quantum transi- 
tion of the OH radical (A = 18.5 cm) in galactic nebulae 
[16]. The exceedingly high temperature of the radio emis- 
sion of hydroxyl molecules (T,, S 10!° K) was immediately 
accounted for using the maser principle—the amplification 
in the medium of molecules with population inversion of the 
transition energy levels corresponding to the frequency of 
the spectral line being amplified. Suppose the laboratory 
maser had not been constructed by that time. Then it would 
have been inevitably discovered owing to radioastronomical 
observations. 


3. Lasers—the Mainstay of Progress 
in Quantum Electronics 


The advent of early optical quantum oscillators late in the 
1950s and early in the 1960s marked a revolution in the 
young science of quantum electronics. Using Einstein’s pro- 
cess of induced, or stimulated, emission it tackles the prob- 
lems of production, control and use of the light energy. Lasers 
are, in essence, converters of energy of relatively low quality 
(thermal, chemical, electrical—depending on the way in 
which population inversion in a medium is obtained) 
into the energy of coherent radiation (a flux of photons iden- 
tical in all respects), which can be viewed as a higher-quali- 
ty form of energy. 

The high power of laser radiation, the high directivity 
and monochromaticity, the possibility to form short pulses 
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of coherent light and to adjust the frequency of radiation 
make coherent light a highly flexible tool suitable for a wide 
variety of applications. Quantum electronics made it pos- 
sible for the first time at will to transform and concentrate 
in space, time and spectrum the energy of light to the same, 
or even higher, degree than it has been possible, for example, 
with electromagnetic energy in lower-frequency ranges— 
from d.c. to microwaves. The present period of mastering 
the electromagnetic oscillations in the optical range seems 
to follow logically the century-long period of the discoveries 
and development in electricity, and it is now even diffi- 
cult to predict all the implications of these developments. 

During the last 20 years a multitude of various lasers have 
been produced for an infinite variety of purposes, which 
differ markedly in their characteristics [17, 18]. Above all, 
lasers have different modes of operation. There are lasers of 
continuous, pulsed and pulsed-periodic operation. Pulsed 
lasers are designed to produce pulses with high peak powers 
(up to 103 W on unique facilities), short duration (down to 
fractional picoseconds), and high energy (up to tens of kilo- 
joules on unique facilities). Continuous-wave lasers are de- 
signed to produce continuous radiation of power ranging wide- 
ly (from fractional milliwatts to hundreds of kilowatts for 
different laser media). They have high frequency stability 
(up to relative instabilities of 10-‘*). 

To be sure, it is never possible to have all the extremes of 
characteristics in one and the same laser, and, as a rule, 
each specific laser is constructed to obtain coherent light 
either with several satisfactory parameters or with one ex- 
treme parameter, the other parameters being moderate. The 
latter situation is typical of lasers designed for a concrete 
application. For example, high-temperature heating ofa 
plasma requires high-power short pulses of high energy and 
adequate spatial coherence to be able to focus highly in- 
tense radiation on a small area, the spectral width and wave- 
length of the radiation, even within a wide range, are im- 
material. Another example: thermal processing of materi- 
als (welding, cutting, hardening) requires continuous-wave 
lasers with high power and lenient requirements to radiation 
spectrum. 

Active (amplifying) media in lasers may be various sub- 
stances in various states of aggregation [17, 18]: crystals 
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and glasses with ion additives, semiconductors, molecular 
solutions, atomic and molecular gases, gas-discharge plas- 
ma, and so on. For instauce, a glass with some neodymium 
ions is used in unique laser installations to obtain high- 
power short pulses. Neodymium glass, just like ruby crystals, 
is pumped optically by powerful flash lamps. Population in- 
version in semiconductors is normally obtained using an 
electric current; and in atomic and molecular gases, using 
pulsed or continuous electrical discharge. In some cases, 
especially to produce intense laser beams, the use is made of 
chemical reactions (chemical lasers) and thermal excitation 
(gas-dynamic lasers). 

Lasers also differ widely by quantum transitions employed 
[17, 18]. Lasers operate on electron transitions of atoms 
(He-Ne-laser), molecules (N,-laser, excimer lasers, dye la- 
sers), ions (argon laser), semiconductors (GaAs-laser, etc.), 
vibrational (CO,-laser, etc.) and rotational (HF-laser, etc.) 
transitions of molecules. We may now be safe in saying that 
virtually any optically homogeneous medium can, using 
a suitable pumping technique, be brought, if only for a short 
time, to a nonequilibrium state with population inversion 
for some pair of energy levels. 

The best way to trace the development of the quantum 
electronics of the optical range is by watching the improving 
of the characteristics of laser radiation: the growth of 
power under pulsed and continuous-wave modes, the shorten- 
ing of laser pulses, the improvement of spatial coherence. 
As each of these characteristics improved, many, often un- 
expected, applications of coherent light in various fields of 
science and technology were emerging. The high power of 
laser radiation lies at the heart of the many applications 
(from laser chemistry to laser thermonuclear fusion) that 
use strong excitations and/or heating of matter with light. 
The ultrashort pulses enabled metrologists dealing with 
measurement of ultrashort time intervals to go over from 
nanoseconds to subpicoseconds. The high stability and mo- 
nochromaticity of quantum oscillators are the basis of the 
quantum metrology of time and length. 

The most spectacular is the growth of power of laser ra- 
diation. The first laser in the world [13] used a ruby crystal 
and produced light pulses of power about 1 kWand duration 
about 10-3 s. Modern multistage and multichannel neody- 
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Fig. 2 Growth of the output power of (a) pulsed and (b) continuous-wave 
lasers over the years 


mium laser facilities are able to generate more than 10% J 
during 1 ns, i.e. to produce pulses of power 101? W = 10 TW. 
Thus, during the last 20 years the power of coherent light 
pulses has increased 101° times. 

This spectacular progress was only possible due to tho 
invention and construction of various types of lasers. La- 
sering process control, which consists in quickly giving off 
the energy accumulated in an active medium in the form of 
an extremely short pulse (~40 ns) (Q-switched mode [19]). 
allowed the pulse power to be increased up to 10° W. Using 
the multistage amplifier [20] increased the energy up to about 
10-100 J and the power up to 10°-40!! W. Lastly, the use 
of laser disc amplifiers of large diameter (up to 30 cm) and 
parallel stages of amplifiers with subsequent summation of 
energy at the target made it possible to generate pulses 
with energy 1-10 kJ and power 1012-4013 W [21]. 

Figure 2a shows the chronology of pulse powers achieved 
with the advances in laser technology. The progress was so- 
lely due to improvements in methods of control of the laser- 
ing process (shortening the pulse duration) and pulse ampli- 
fication in multistage and multichannel devices (increasing 
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the pulse energy). As to the energy accumulated and given 
off in the form of coherent light by a unit volume of active 
medium, achievements were modest, and so high-power la- 
sers of necessity use large volumes of active medium (thou- 
sands and more litres). 

The principle of the first continuous-wave laser [14] 
was the glow-discharge in the He-Ne mixture, its power 
being under a milliwatt: And now there exist 100-kW con- 
tinuous-wave CO,-lasers [22]. During the first 10 years the 
power of continuous-wave lasers increased 10'° times. This 
was achieved owing to the discovery of new active laser 
media, new pumping methods and new modes of operation. 
Switching from glow-discharge in the low-pressure He-Ne 
laser to high-current discharge in argon allowed one to obtain 
about 10 W (in recent years more than 100 W) in visible 
light using the transitions of the argon ion [23]. The next 
stage was the discovery of lasering at vibrational transitions 
of the CO, molecule in the discharge of the CO, : N,: He 
mixture [24], i.e. the discovery of lasering at low vibrational 
molecular levels. This increased the power up to hundreds 
of watts. Lastly, the solution of the problem of heat removal 
from the pumping region by continuous fast blowing of a 
gaseous mixture in combination with the use of high-power 
pumping (electric discharge, heating and gas-dynamic ex- 
pansion) made it possible to increase the power up to 
100 kW, and this does not seem to be the limit [22]. Figure 
2b shows the chronology of the growth of the output power 
of continuous-wave lasers of various types. Unlike high- 
power monopulsed lasers the progress here was nearly complete- 
ly due to the discovery of new types of laser media and 
methods of exciting them. 


4. Coherent Light and Progress 
in Optics and Spectroscopy 


"Today, after a quarter of a century of the development of 
optical quantum electronics, we can look at the evolution 
of optics from another angle. 

. In optics, a branch of physics that studies the nature of 
light and the phenomena occurring in interaction of light 
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with matter, in the 20th century two distinct stages stand 
out. The first stage, up to the 1950s, was concerned with es- 
tablishing the quantum nature of light and its interaction 
with matter, the construction of quantum mechanics using 
the evidence of atomic and molecular spectra, and lastly 
the interpretation of optical properties of matter on the ba- 
sis of quantum theory. The second stage, from the 1950s on- 
wards, has consisted in using the laws of quantum theory to 
obtain coherent light and to control its interaction with mat- 
ter, in fusing the ideas and methods of coherent optics and 
radiophysics. During this period, at the junction of optics 
and radiophysics, quantum electronics and related 
branches of optics—nonlinear optics and holography —made 
their appearance. 

The coming of laser sources of coherent light prompted 
the development of coherent optics. Of especial consequnce 
in this field is the discovery of holography, which enables all 
information about a light field to be recorded and repro- 
duced. This information includes the amplitude, phase and 
polarization of a light wave, not only its intensity (as is 
the case in photography). In the late 1940s D. Gabor [25] 
published his idea of recording a wave field in one plane, 
the idea was then refined by the Soviet scientist Yu. Deni- 
syuk, who in 1962 obtained three-dimensional holograms 
[26]. Later holograms were obtained such that when ex- 
posed to incoherent white light the viewer sees a three-dimen- 
sional image of the object. These days holography has blos- 
somed into an independent growing branch of optics with 
many applications. 

With the advent of high-power lasers there emerged non- 
linear optics, a branch based on the dependence of optical 
properties of medium on the intensity of light. Note that 
the term ‘nonlinear optics’ was coined by S. Vavilov long 
before the coming of lasers. As light interacts in a nonlinear 
manner with matter its frequency undergoes transformations, 
the light beam undergoes self-focusing, the medium under- 
goes self-bleaching, and so forth. Now the techniques of 
nonlinear optics [27, 28] have made their way into all do- 
mains of traditional optics and allow researchers to handle 
many problems, which previously seemed to be unsolvable. 
By way of example we can mention the wave front conjuga- 
tion. 


14e 


212 V. S. Letokhov 


The reversibility of light rays has long been known in 
optics, and it follows from the equations of electrodynamics. 
But the unique potentialities of the reversibility could not 
be tapped for want of devices to carry out this operation with 
involved spatially inhomogeneous light fields in real time. 
The operation was successfully carried out early in the 1970s 
using two nonlinear-optical methods: the four-wave mixing 
of light waves in a nonlinear medium [29] and induced scat- 
tering of light by hypersound [30]. This expanded signifi- 
cantly the scope of physical optics. For example, it became 
possible to focus with real lenses high-power laser beams of 
large aperture onto a spot with diffraction dimensions, to 
sum up in a coherent fashion laser beams that have come to 
the target by independent paths, and so on. 

Laser radiation radically changed the scene in spectro- 
scopy, having provided the limiting spectral resolution, the 
limiting sensitivity and temporal resolution. Specifically, 
on the basis of nonlinear interactions of laser radiation with 
atoms and molecules extremely narrow optical resonances with 
a relative width of down to 10-11 have been obtained. These 
resonance techniques formed the foundation of nonlinear su- 
perhigh-resolution laser spectroscopy free of the Doppler 
broadening [31]. Using selective multistep photoionization of 
atoms and molecules the limiting sensitivity at the level 
of single atoms [32] and molecules [33] has been attained. 

The second stage of the evolution of the optics of the 20th 
century is still far from its completion. Alongside further 
vigorous growth of laser technology, which would make 
laser an accessible and useful tool for any potential user, 
new ground will have to be covered in quantum electronics, 
coherent and nonlinear optics. In particular, quantum elec- 
tronics will have to intrude into vacuum ultraviolet and 
soft X-ray radiations. 


5. Picosecond Quantum Electronics 


Despite the enormous potential of electronics, before the 
laser the metrology of short time intervals was progressing 
at a relatively slow pace. When the Kerr cell controlled by 
an electric pulse made its appearance nanosecond resolutions 
became reality. Since 1965, five years after the laser had been 


Quantum Electronics 243 


produced, extensive research efforts got under way to de- 
velop measuring techniques for subnanosecond intervals using 
lasers producing ultrashort pulses. It took only ten years to 
work out techniques for times as short as 10-!3 s, which were 
based on the fundamental principles of quantum electronics 
and nonlinear optics. Picosecond quantum electronics is 
based on induced quantum transitions with rate proportional 
to the light intensity. The rate may be many orders of mag- 
nitude higher than the rate of spontaneous radiative tran- 
sitions. 

In fact, in the simplest case of the resonance of radiation 
with the transition 1 — 2 during a time interval longer than 
the time within which phase memory is lost in an ensemble 
of quantum systems 7, (for molecules in a solution at room 
temperature T, œ 10-'!2s) the induced transition rate Wina 
between levels 1 and 2 is given by 


Wing = Orel /horz2, (1) 


where o,, is the cross-section of the radiative transition. 
For a typical molecule in a solution with a wide band of elec- 
tron absorption o,, œ 10-16 cm?. Under exposure to light 
with intensity J = 10° W/cm’, Wma œ 10% s-i, i.e. the 
molecule can be induced to emit and absorb a photon with 
probability close to 100% during a picosecond. Therefore, 
using sufficiently intense laser light we can control its absorp- 
tion or amplification in a corresponding (absorbing or ampli- 
fying) medium with picosecond time scales. It is this unique 
feature of induced transitions that lies at the heart of pico- 
second lasers. 

Going to ultrashort durations t of lasering process leads 
to the broadening Av of the laser spectrum, since Av-t œ 1. 
For example, the spectrum of a laser pulse with duration 
Tt = 1 ps has the width Av = 30 cm! = 10!" Hz. Therefore, 
a laser with such a wide frequency interval has very many 
longitudinal modes. The average frequency interval between 
longitudinal modes is 6v = v/2L (L is the length of the res- 
onator, v is the average group velocity of light in the res- 
onator) and for a typical fluorescent crystal or glass laser 
ôv ~ 0.01 cm. 

Consequently, a picosecond laser is a device in which 
3 x 10% longitudinal modes are generated in phase. 

In a multimode laser the phases of light oscillations in 
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each mode are generally independent of one another, and 
hence are absolutely random. The radiation of such a laser 
is just like light noise. It is possible, however, to observe mu- 
tual induced or spontaneous phase-locking of light oscillations 
in various modes. Induced mode-locking is achieved by perio- 
dic modulation of, say, losses or by amplification at a frequen- 
cy Q coincident with the intermode frequency interval ôv [34]. 
Spontaneous mode-locking can occur if inside the resonator 
some resonance-absorbing medium is introduced [35]. Such a 
medium bleaches with a rate Wina in an intense laser field and 
returns quickly to the original absorbing state due to the 
nonradiative relaxation of electron excitation. 

A laser with a quickly bleaching medium in the reso- 
nator is a nonlinear unstable self-oscillatory system in which 
phases of the light fields in various modes are spontaneously 
locked (self-mode-locking), or, in other words, ultrashort light 
pulses are generated spontaneously. This laser is a fascinat- 
ing example of a nonlinear nonequilibrium system in which 
order spontaneously emerges out of disorder; in other words, 
modes generate synergetically [36]. 

The natural limit for shortening the pulse length is the 
period of a light oscillation T = 1/v œ 10-% s=1 fs. 
To reach this limiting duration is a quite complex task, 
since a femtosecond pulse is bound to broaden even within 
a range of several wavelengths due to the dispersion of the 
optical medium. Nevertheless, the shortest laser pulse todate 
has a duration of t = 30 fs [37], a value of the same order 
of magnitude as the period of molecular oscillations. During 
such a short time atoms and molecules with the typical ther- 
mal velocity vy» = 3 x 104 cm/s shift by only 0.1 A, i.e. 
by a distance much shorter than their dimensions. Therefore, 
an atom or molecule registered by a subpicosecond laser pulse 
will appear to be stationary for an observer. 


6. Quantum Electronics and Quantum 
Metrology 


Quantum electronics made a revolution not only in the mea- 
surement of ultrashort time intervals but also in the pre- 
cision measurement of long intervals of time (atomic time 
scale) and length, ` 
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In the 1940s and 1950s were worked out techniques of form- 
ing narrow spectral lines using atomic and molecular beams. 
For example, two spaced microwave fields [38] interacting 
with a beam of Cs atoms at the transition between the levels 
of the hyperfine structure of the ground state were used to 
obtain resonances with a width of Av = 50 Hz at v, = 
9.3 GHz. This corresponds to the relative width Av/v, = 
5 x 10°? or the Q-factor (resolution power) of the reso- 
nance R = v,/Av = 2 x 108. A yet narrower spectral line 
(Fig. 3) is possessed by the hydrogen maser [10] operating at 
1.420 GHz (the famous 21-cm line). The narrow radiofre- 
quency resonances were the underlying principle of most 
important devices, quantum frequency standards, and the 
atomic time scale now adopted throughout the world. 

Exceedingly narrow resonances at much higher frequencies 
(nuclear transitions) were discovered by Mössbauer [39]. 
For example, for the 93-keV y-transition in ®Zn a resonance 
with Q-factor 2 x 101° can be obtained. The narrow reso- 
nances of nuclear transitions in crystalline lattice now yield 
the highest sensitivity in measurements of the detuning of 
transition energy and quantum energy. 

Nonlinear interaction of highly monochromatic laser ra- 
diation coherent in time and space with atoms and molecules 
can also yield narrow spectral resonances in optical range 
with a width many orders of magnitude smaller than the Dop- 
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pler width [31]. For instance, the absorption line at vibration- 
al-rotational transitions in the CH, molecule at about A = 
3.39 um has a Doppler broadening of 3 x 108 Hz. On the 
other hand, in a He-Ne-laser with a CH,-absorbing cell 
within the resonator were obtained spectral resonances with 
a width of only 10° Hz, i.e. with Q-factor 10° [40, 41]. With 
the help of this narrow nonlinear resonance the frequency of 
the He-Ne-laser was stabilized at the centre of the absorp- 
tion line of the methane molecule and a stability and repro- 
dueibility of frequency of about 10-18% was achieved [40]. 
This laser has an emission line only several hertzes wide, 
i.e. the laser is the most monochromatic radiation source in 
the entire enormous range of electromagnetic oscillations 
mastered by modern technology. 

Unlike microwave oscillators, a laser with highly stable 
radiation frequency in the optical range can at the same time 
be a standard of length and time. The shortness of light 
wavelength as compared with characteristic dimensions of de- 
vices allows precise interference measurements of length to 
be made. That is why the current international length stan- 
dard is the wavelength of the optical spectral line of Kr. 
True, unlike the radio range, up until fairly recently no fre- 
quency, i.e. time, measurements were possible in the optic- 
al range. Therefore, length and time are measured using two 
different spectral lines—the optical emission line of Kr 


at 6056.9 A for length measurements and the microwave ab- 
sorption line of 133Cs at 3.27 cm for time measurements in 
the international time standard, the cesium atomic standard. 
The laser frequency standard in the optical range is a realiza- 
tion of Brillouin’s suggestion [42] to create a unified quan- 
tum standard of length and time. The most realistic form of 
this unified standard is the one using a laser as a source of 
electromagnetic oscillations with a highly stable and re- 
producible emission wavelength A and frequency v [43] 
related by c = Aw (c is the velocity of light). 

In metrology the laser turned out to be an exceptionally 
useful tool. For example, metrologists now use the laser length 
standard, based on frequency stabilization of a He-Ne-laser 
at 6328 A by narrow resonance absorptions of the iodine 
molecule. Its precision is better than 10-1", i.e. two orders 
of magnitude better than the available krypton length 
standard, Stable-frequency lasers were used to measure the 
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velocity of light to within 10° [44], which is also two orders 
of magnitude better than earlier measurements. On the hasis 
of these achievements in 1983 was introduced a new defi- 
nition of the metre in terms of the velocity of light ‘as the 
distance covered by light in a vacuum during the time in- 
terval 1/299,799,458 seconds’. Recently comparisons have 
been performed of the frequencies of lasers using molecular 
datums in the optical range, and of the frequency of the ce- 
sium atomic standard to within 10-11. It is expected that 
the accuracy of the comparison will be improved up to 10-1, 
which will make it possible to check the constancy in time 
of the fundamental physics constants, which exert different 
influences on atomic and molecular frequencies [45]. 


7. Quantum Electronics 
and Some Future Problems 


Quantum electronics and its applications are growing fast 
these days. This progress can perhaps be compared with the 
growth of radioelectronics and its applications in the 1920s 
and 1930s. Coherent light finds it way into virtually all 
spheres of human endeavor. Lasers are of especial interest 
in those domains of science and technology where they can be 
of help in solving the global problems of mankind. This 
question deserves special analysis, which lies beyond the 
scope of this article. But it would also be wrong to pass 
over this question in a review of the strides of quantum elec- 
tronics, one of the most spectacular achievements of the 
physics of the 20th century. Therefore, we will confine our- 
selves to several important issues, the uses of quantum elec- 
tronics} in’ communications and informatics, power engi- 
neering, chemistry, and biology. 

Communications and informatics. The inventions and wide 
spread of means of communications, storage and processing 
of information are some of the most distinguishing features of 
the scientific and technological revolution of the 20th cen- 
tury. Lasers and coherent light are able significantly to im- 
prove the capabilities of communications and informatics 
in the near decades. 

Immediately after the laser had been invented engincers 
noted the most important feature of the optical range in 
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terms of information transmission—the exceptionally high 
information-carrying capability of the light beam (about 
4 x 105 GHz), an equivalent of the capacity of 10° video 
channels. Technologically, this unique property of the laser 
beam could be realized after two inventions made their ap- 
pearance. First, the miniature semiconductor laser was con- 
structed (about the size of a grain). It used a heterostructure, 
i.e. a multilayer p-n junction, and therefore could operate 
at room temperature [46]. The radiation intensity of this 
laser diode can be readily controlled within about 10° Hz 
by modulating the bias current. Second, glass fibres made 
their appearance, which could transmit light with negligible 
losses (less than 1 dB/km) [47]. For example, information 
carried by laser radiation can be communicated over a sin- 
gle-mode fibre light guide with a bandwidth-distance product 
of about 100 GHz-km. Spectral multiplexing of channels in 
fibre-optical communication lines will enable the limiting 
channel capacity to be attained by relatively simple means. 
This will make information unit transfer per unit length 
less expensive by far in terms of funds and energy. Using 
this technique society might afford communication of per- 
sonal information at the user’s will. This could perhaps 
replace today’s transmission of the same information to the 
mass user (radio and television broadcasting, cinema, etc.) 
and could influence the way of life of the postindustrial so- 
ciety [48]. 

Despite the enormous strides made in information storage 
and processing, mostly due to the advances in microelectron- 
ics, there are still some fields in which the laser could be of 
much help [49]. First, it appears quite feasible to create an 
optical holographic memory with a capacity up to 10%° bits 
per 1-m? area. Spectral multiplexing in photochemical in- 
formation recording would make it possible to overcome the 
recording density barrier of 4 bit per A? (A is the light wave- 
length) and to achieve three-dimensional information stor- 
age in”a plane with a density of no less than 10° bits per A? 
[50]. The optical technique of information recording and read- 
ing is distinguished for the easy access to memory and is 
therefore suitable*for storage off both operative and archive 
information. 

Lastly, the principles of coherent optics and holography 
offer promise in respect of mathematical processing of images, 
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i.e. they allow logical operations in parallel over many chan- 
nels. This in turn opens up great possibilities for creating 
in the future of optical discrete computers with speeds much 
higher than those of electronic computers. 

Power. There are three directions apparent to-day in which 
lasers could make their contribution to the power industry 
of the future: laser separation of isotopes, laser-controlled 
thermonuclear fusion and transport of light energy over large 
distances. 

Laser separation of isotopes uses the selective action of 
light on atoms or molecules of a certain isotope composition. 
The most efficient and universal here are the isotopically 
selective photoionization of atoms and isotopically selective 
photodissociation of molecules in absorption of several 
photons. These methods were suggested and realized experi- 
mentally for the first time in the Spectroscopy Institute of 
the USSR Academy of Sciences in the years 1970-74 (see 
[51]) and then they attracted attention in the USA and other 
countries. 

Of the greatest importance for power industry is the ap- 
plication of laser isotope separation to enrich uranium in 
23U to be used as fuel in nuclear reactors. The high efficiency 
and low energy requirements of the process can make the 
nuclear fuel much cheaper. The potentialities of lasering 
processes are best illustrated by noting that the gas-diffusion 
method requires 5 MeV, and centrifuging about 0.3 MeV 
to separate each ?U atom. On the other hand, the lasering 
process, which uses selective ionization or dissociation, re- 
quires less than 10 eV of light energy per atom, or 0.1-10 keV 
of electric energy for a typical efficiency of lasers from 10 
to 0.1%. 

High selectivity, i.e. high enrichment factor, inherent in 
most of laser separation methods will result in a drastic 
curtailment of the residual 2®U in the tailings, which will 
cut the need for new ore by 40%. High selectivity is very 
important for the enrichment of rare isotopes, e.g. for the 
extraction of deuterium from jwater. Deuterium in quantity 
is required for heavy-water nuclear reactors using natural 
uranium (not enriched). 

Finally, laser techniques can be employed for extracting 
the radioactive isotopes in used fuel from nuclear reactors. 
First, this is of importance for reducing the radioactivity 
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of reactors (e.g. to remove tritium from heavy-water reac- 
tors); second, this is of interest for the production of the va- 
rious cheap radioactive isotopes. 

The production and heating of high-temperature plasma 
by laser radiation is a new direction, which emerged at the 
junction of quantum electronics and plasma physics. That it 
is possible to heat up matter to thermonuclear temperatures 
using laser radiation focused at a target was shown theoretic- 
ally [52] as early as in 1962, soon after the early high- 
power pulsed lasers came on the scene. These predictions ini- 
tiated investigations into laser controlled thermonuclear 
fusion (LCTF) now under way in many countries. The modern 
concept of LCTF [53] is based on irradiating from all direc- 
tions of a small thermonuclear spherical target (0.1-1 mm 
in dia) by a high-power (10'3-101* W) laser pulse of nano- 
second duration. Exposed to that high-power radiation the 
spherical target must contract strongly (up to densities 
102-108 g/cm?) and heat up to more than 10° K. This must 
initiate a fusion reaction with a release of energy in the 
deuterium-tritium mixture. 

To achieve a breakeven situation, when the energy re- 
leased in the fusion will be comparable with the laser energy 
input into the target the neutron yield must be 1018-101 per 
pulse. It has been estimated that this can be achieved at 
laser pulse energies up to 10° J. A viable laser thermonuclear 
reactor will require reliable lasers with a pulse energy of no 
less than 10° J and a repetition rate of several hertzes. Con- 
struction of such lasers is a challenging problem in terms 
of physics and engineering, whose solution is one of the main 
areas of the growth of quantum electronics. It can be expect- 
ed that this problem will be solved by the end of the 20th 
century. It is quite probable that by that time the economics 
of a laser thermonuclear power station will become more 
clear, which in the final analysis determines the expediency 
of LCTF in power production as compared with alternative 
methods. 

Space research, notably long-term space labs, is another 
of the prospective employments for quantum electronics, 
which is again concerned with power production. For one 
thing, in space we have a constant and free source of inco- 
herent light energy, the solar radiation. In a near-Earth 
orbit a 300-m collector produces about 100 MW of solar ener- 
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gy. And the costs of this energy will be determined by the 
costs of fashioning and maintaining in orbit the light col- 
lector. For the other thing, large space stations need high- 
power energy sources. This problem is now solved using solar 
batteries, which directly transform the light from the Sun 
into electricity. Quantum electronics prompts another way 
of using the solar energy and transmitting it over large dis- 
tances on scales much larger than the current requirements 
of space labs. This path consists, first, in constructing or- 
bital lasers pumped either directly by solar light or by a 
nuclear reactor in orbit; second, in transmitting the solar 
energy in the form of highly collimated coherent light beams 
to both space objects that need this energy and to the Moon, 
Earth, and so on; third, in direct transforming the energy 
of laser radiation into a form suitable for the majority of 
applications. Now we seem to be in the early stage of the 
development of all the key components required for the fu- 
ture laser-nuclear-solar power production. 

In the first stage of the development of space-based laser- 
solar systems the mass of power supply units on satellites 
can obviously be reduced at first by switching to feeding them 
by laser radiation from orbital lasers. In the next stage light 
energy will be transmitted to major stations on the Moon 
and, in the end, to the Earth to increase its energy potential. 
This sequence of steps seems to be quite logical in the chain 
of scientific and technological achievements and current 
needs of humanity. It seems quite probable that in the future 
it would pay to have large laser power stations much closer 
to the Sun than the Earth. For instance, in Mercury’s orbit 
the density of the solar energy is already 10 kW/m?, and so 
the energy removed would be an order of magnitude higher 
and the potential amplifier media would be more numerous. 
To be sure, this project is as yet beyond the engineering 
capabilities of the 20th century. 

Chemistry and biology. Laser light opened up absolutely 
new possibilities for selective impacts on matter at the molec- 
ular level, which is of considerable interest for chemistry 
and biology. 

Excitation of atomic or molecular states by visible or ul- 
traviolet light from conventional sources is a routine practice 
in photochemistry. Laser radiation here came in handy for 
two reasons. First, the high spectral brightness of laser ra- 
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of reactors (e.g. to remove tritium from heavy-water reac- 
tors); second, this is of interest for the production of the va- 
rious cheap radioactive isotopes. 

The production and heating of high-temperature plasma 
by laser radiation is a new direction, which emerged at the 
junction of quantum electronics and plasma physics. That it 
is possible to heat up matter to thermonuclear temperatures 
using laser radiation focused at a target was shown theoretic- 
ally [52] as early as in 1962, soon after the early high- 
power pulsed lasers came on the scene. These predictions ini- 
tiated investigations into laser controlled thermonuclear 
fusion (LCTF) now under way in many countries. The modern 
concept of LCTF [53] is based on irradiating from all direc- 
tions of a small thermonuclear spherical target (0.1-1 mm 
in dia) by a high-power (1013-101 W) laser pulse of nano- 
second duration. Exposed to that high-power radiation the 
spherical target must contract strongly (up to densities 
102-108 g/cm?) and heat up to more than 10° K. This must 
initiate a fusion reaction with a release of energy in the 
deuterium-tritium mixture. 

To achieve a breakeven situation, when the energy re- 
leased in the fusion will be comparable with the laser energy 
input into the target the neutron yield must be 1016-101" per 
pulse. It has been estimated that this can be achieved at 
laser pulse energies up to 10° J. A viable laser thermonuclear 
reactor will require reliable lasers with a pulse energy of no 
less than 10° J and a repetition rate of several hertzes. Con- 
struction of such lasers is a challenging problem in terms 
of physics and engineering, whose solution is one of the main 
areas of the growth of quantum electronics. It can be expect- 
ed that this problem will be solved by the end of the 20th 
century. It is quite probable that by that time the economics 
of a laser thermonuclear power station will become more 
clear, which in the final analysis determines the expediency 
of LCTF in power production as compared with alternative 
methods. 

Space research, notably long-term space labs, is another 
of the prospective employments for quantum electronics, 
which is again concerned with power production. For one 
thing, in space we have a constant and free source of inco- 
herent light energy, the solar radiation. In a near-Earth 
orbit a 300-m collector produces about 100 MW of solar ener- 
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gy. And the costs of this energy will be determined by the 
costs of fashioning and maintaining in orbit the light col- 
lector. For the other thing, large space stations need high- 
power energy sources. This problem is now solved using solar 
batteries, which directly transform the light from the Sun 
into electricity. Quantum electronics prompts another way 
of using the solar energy and transmitting it over large dis- 
tances on scales much larger than the current requirements 
of space labs. This path consists, first, in constructing or- 
bital lasers pumped either directly by solar light or by a 
nuclear reactor in orbit; second, in transmitting the solar 
energy in the form of highly collimated coherent light beams 
to both space objects that need this energy and to the Moon, 
Earth, and so on; third, in direct transforming the energy 
of laser radiation into a form suitable for the majority of 
applications. Now we seem to be in the early stage of the 
development of all the key components required for the fu- 
ture laser-nuclear-solar power production. 

In the first stage of the development of space-based laser- 
solar systems the mass of power supply units on satellites 
can obviously be reduced at first by switching to feeding them 
by laser radiation from orbital lasers. In the next stage light 
energy will be transmitted to major stations on the Moon 
and, in the end, to the Earth to increase its energy potential. 
This sequence of steps seems to be quite logical in the chain 
of scientific and technological achievements and current 
needs of humanity. It seems quite probable that in the future 
it would pay to have large laser power stations much closer 
to the Sun than the Earth. For instance, in Mercury’s orbit 
the density of the solar energy is already 10 kW/m?, and so 
the energy removed would be an order of magnitude higher 
and the potential amplifier media would be more numerous. 
To be sure, this project is as yet beyond the engineering 
capabilities of the 20th century. 

Chemistry and biology. Laser light opened up absolutely 
new possibilities for selective impacts on matter at the molec- 
ular level, which is of considerable interest for chemistry 
and biology. 

Excitation of atomic or molecular states by visible or ul- 
traviolet light from conventional sources is a routine practice 
in photochemistry. Laser radiation here came in handy for 
two reasons. First, the high spectral brightness of laser ra- 
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diation is sufficient to excite any discrete quantum state 
of an atom or molecule without necessarily searching for a 
random coincidence of the intense line of the spontaneous 
radiation of a conventional source with the absorption line 
of the particle being excited. Second, the high spectral bright- 
ness of IR-lasers is sufficient to excite molecular oscilla- 
tions [54]. Moreover, the high intensity of laser radiation 
gave birth to new techniques of multiphoton nonlinear pho- 
tochemistry [51] unparalleled in conventional photochemis- 
try. These techniques are based on the absorption of several 
(from two to tens) laser photons by one particle (an atom or 
molecule). The process is possible owing to the laser reso- 
nance excitation of most of particles and subsequent absorp- 
tion of photons by the excited particles. Note that the mul- 
tiphoton excitation of atoms or molecules was impossible 
in principle with conventional light sources because of the 
low temperature of their radiation. 

Especially exciting is the promising field of multiphoton 
oscillatory IR-photochemistry. So far molecular vibrational 
excitation was only possible through heating under equilib- 
rium conditions, when all the degrees of freedom (transla- 
tional, vibrational, rotational) of all the particles in a mix- 
ture are excited by collisions. The multiphoton excitation of 
molecular vibrations by an intense IR-beam has the advan- 
tages that (1) excitation occurs purely radiationally without 
a contribution from collisions, and hence without exciting 
translational degrees of freedom; (2) it occurs with a high 
rate (up to 101? eV/s) due to the high rate of induced tran- 
sitions; (3) it is selective because of the resonance absorption 
of radiation by molecules of certain type in the mixture. 
These properties are all employed successfully in new laser 
techniques of isotope separation based on the multiphoton 
isotopically selective photodissociation of molecules by 
IR-radiation [55]. The methods of multiphoton IR-excita- 
tion, worked out mainly for isotope separation, are, however, 
extremely promising for chemistry. 

In a manner like high-power IR-radiation, intense visible 
or UV-radiations also cause multiphoton excitation of mole- 
cules, but via electron intermediate resonance states. For a 
molecule to absorb 10 eV in this case 2-3 photons would be 
sufficient. Excitation of high electron states of molecules 
initiates various photochemical reactions and processes that 
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yield products differing markedly from those formed in 
single-photon excitation of lower electron states. Note that 
it is in principle impossible to achieve single-photon excita- 
tion of high molecular states with an energy of 6-10 eV in 
solutions since the solution absorbs vacuum UV-radiation. 

Holding especial promise, it seems, is the use of high- 
power picosecond radiation, which can quickly excite high 
electron states in molecules and thus stimulate corresponding 
photoreactions with molecules of a certain type without in 
any way heating the medium. Here again we observe the 
above-mentioned important feature of stimulated transi- 
tions in an excited molecules—their high rate in a suffi- 
ciently intense laser field. Therefore, the rate of multiphoton 
excitation due to stimulated transitions according to (1) 
can lie in picosecond range, and hence it can compete suc- 
cessfully with the relaxation of excited molecules. Under 
these conditions molecules are excited without heating the 
medium. This is of especial importance for laser stimulation 
of biochemical reactions, which are very sensitive to heating. 
It has already been found that nucleic acids can be destroyed 
selectively in this way [51]. In my opinion, photochemical 
transformations of biomolecules stimulated by picosecond 
radiation open up a new field of photobiochemistry and evi- 
dently in the future they will make it possible to have laser 
pharmacology with biomolecules required being destroyed, 
modified or synthesized in vivo and in situ [56]. 

Finally, it is quite possible that selective excizing of some 
chromophores in biopolymer molecules exposed to picosecond 
laser beams will become the basis for direct observation of 
molecular structures with a resolution of about 1 A [54]. 
If this approach turns out to be a success, it will be possible 
directly to read the sequence of bases in a DNA, i.e. directly 
to read genetic information. As a result, gene engineering 
can obtain a fast analytical technique to select objects for 
genetic modifications and analysis of its results. 


8. Future Trends 


Radio oscillators of electromagnetic radiation use the period- 
ic motion of charged particles—electrons—to radiate ac- 
cording to the laws of classical physics. In this way, through 
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the use of various forms of electronic oscillators the radio 
range of the electromagnetic spectrum (up to about 10’ Hz) 
was mastered, which led to the emergence of radio, televi- 
sion, radar, and computers, 

Quantum oscillators of electromagnetic radiation use 
quantum transitions in particles (atoms, molecules, ions) 
caused by changes in their internal energy, not by their mo- 
tion. The operation of quantum oscillators (masers and la- 
sers) is governed by the laws of quantum physics. The last 
25 years have seen vigorous expansion of quantum electron- 
ics into the optical range. 

Scientists are now discussing ways of conquering shorter 
wavelengths—vacuum ultraviolet and X-rays—by using 
quantum transitions between energy levels of multiply 
charged ions in a plasma; and also the gamma-range, by 
using y-transitions between nuclear levels. In all of these 
cases the ideas of population inversion and the positive 
feedback are used for generation of stimulated emission. 
Practical realization of all the schemes of oscillators of X- 
and y-ray stimulated emission will be an enormous prob- 
lem [57, 58]. 

At the same time we now witness a new tendency in ven- 
turing into shorter wavelength ranges, namely, returning to 
the ideal radiating particle, the electron, but now in the 
megaelectronvolt range of its kinetic energy, where the elec- 
tron’s motion is governed by relativistic and quantum laws. 
Let us take several examples, 

In 1975 for the first time light amplification was per- 
formed by relativistic electrons undergoing forced vibrations 
in a spatially periodic structure. If the period of the varia- 
tion of the magnetostatic field A lies in the region 1-5 cm, 
then in the intrinsic coordinate system an electron ‘sees’ 
this structure as contracted by a factor of 2E,;/m,c*, where Eg 
is the electron energy, which may reach up to hundreds me- 
gaelectronvolts; mc? = 0.511 MeV is the rest energy of 
the electron. Therefore, the resonance interaction of the 
electron oscillating in a magnetic field with light occurs in 
the optical range at wavelengths 


a = Am,c?/2Ee. (2) 


This interaction results in an exchange of energy between 
electrons and the light field. If the light field is shifted in 
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phase by half-period relative to the oscillations of an elec- 
tron, then as a result of the exchange of energy the electrons 
will decelerate and the light wave will be amplified. In 1983 
for the first time was used [60] a visible-light free-electron 
laser, in which the 160-MeV electrons circulated in the stor- 
age ring of an accelerator. The success of this laser is due 
to the realization of the ‘optical klystron’ mode of operation, 
suggested by N. Vinokurov and A. Skrinsky in 1977. 

Prospects are good for developing free-electron lasers of 
sufficient power from hundreds of angstroms to hundreds 
of micrometres, with the average power from tens to hundreds 
of kilowatts and an efficiency of about 10%. 

Another interesting effect of electron radiation in the 
short-wave region was predicted by Kumakhov [61]. It re- 
lies on the forced oscillations of a relativistic electron trav- 
elling through a crystal in the ‘channelling’ mode. In that 
case, the period A is the period of the crystalline lattice 
a œ~ 10-8 cm; therefore, the spontaneous radiation of the 
channelled electron belongs to the X-ray range. For dense 
electron beams it is possible in principle to achieve a suf- 
ficient amplification for lasering in the X-ray range. 

Extending these ideas of stimulated emission by relativ- 
istic electrons in periodic structures, the idea springs to 
mind of making a laser using the Smith-Parsell radiation 
effect. The latter is the emission by an electron moving near 
the surface of a metal diffraction grating with a period of 
A œ hop œ 10-10” cm. The oscillating dipole that is 
produced by the electron’s ‘image’ radiates in the optical 
range at low velocities of the electron and in the X-ray re- 
gion at relativistic velocities. Again, at sufficiently high 
electron densities it is in principle possible to obtain ampli- 
fication and lasering in the X-ray range. 

We can thus perceive now some new trend in approaching 
the issue of short-wave lasers: uses of radiation produced by 
relativistic electron beams in various spatially periodic 
structures. Such an alternative approach will perhaps result 
in the exponential growth of the frequency of coherent radia- 
tion sources as shown in Fig. 4. Of course, this approach, 
just like that using the population inversion of multiply 
charged ions and nuclei, will require sufficiently high pow- 
ers. Production of high-power electron beams is however 
well established in experimental nuclear physics; therefore, 
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the use of various forms of electronic oscillators the radio 
range of the electromagnetic spectrum (up to about 10!? Hz) 
was mastered, which led to the emergence of radio, televi- 
sion, radar, and computers. 

Quantum oscillators of electromagnetic radiation use 
quantum transitions in particles (atoms, molecules, ions) 
caused by changes in their internal energy, not by their mo- 
tion. The operation of quantum oscillators (masers and la- 
sers) is governed by the laws of quantum physics. The last 
25 years have seen vigorous expansion of quantum electron- 
ics into the optical range. 

Scientists are now discussing ways of conquering shorter 
wavelengths—vacuum ultraviolet and X-rays—by using 
quantum transitions between energy levels of multiply 
charged ions in a plasma; and also the gamma-range, by 
using y-transitions between nuclear levels. In all of these 
cases the ideas of population inversion and the positive 
feedback are used for generation of stimulated emission. 
Practical realization of all the schemes of oscillators of X- 
and y-ray stimulated emission will be an enormous prob- 
lem [57, 58]. 

At the same time we now witness a new tendency in ven- 
turing into shorter wavelength ranges, namely, returning to 
the ideal radiating particle, the electron, but now in the 
megaelectronvolt range of its kinetic energy, where the elec- 
tron’s motion is governed by relativistic and quantum laws. 
Let us take several examples. 

In 1975 for the first time light amplification was per- 
formed by relativistic electrons undergoing forced vibrations 
in a spatially periodic structure. If the period of the varia- 
tion of the magnetostatic field A lies in the region 1-5 cm, 
then in the intrinsic coordinate system an electron ‘sees’ 
this structure as contracted by a factor of 2E ,,/m,c?, where Eer 
is the electron energy, which may reach up to hundreds me- 
gaelectronvolts; mc? = 0.511 MeV is the rest energy of 
the electron. Therefore, the resonance interaction of the 
electron oscillating in a magnetic field with light occurs in 
the optical range at wavelengths 
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This interaction results in an exchange of energy between 
electrons and the light field. If the light field is shifted in 
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phase by half-period relative to the oscillations of an elec- 
tron, then as a result of the exchange of energy the electrons 
will decelerate and the light wave will be amplified. In 1983 
for the first time was used [60] a visible-light free-electron 
laser, in which the 160-MeV electrons circulated in the stor- 
age ring of an accelerator. The success of this laser is due 
to the realization of the ‘optical klystron’ mode of operation, 
suggested by N. Vinokurov and A. Skrinsky in 1977. 

Prospects are good for developing free-electron lasers of 
sufficient power from hundreds of angstroms to hundreds 
of micrometres, with the average power from tens to hundreds 
of kilowatts and an efficiency of about 10%. 

Another interesting effect of electron radiation in the 
short-wave region was predicted by Kumakhov [64]. It re- 
lies on the forced oscillations of a relativistic electron trav- 
elling through a crystal in the ‘channelling’ mode. In that 
case, the period A is the period of the crystalline lattice 
a œ~ 10-8 cm; therefore, the spontaneous radiation of the 
channelled electron belongs to the X-ray range. For dense 
electron beams it is possible in principle to achieve a suf- 
ficient amplification for lasering in the X-ray range. 

Extending these ideas of stimulated emission by relativ- 
istic electrons in periodic structures, the idea springs to 
mind of making a laser using the Smith-Parsell radiation 
effect. The latter is the emission by an electron moving near 
the surface of a metal diffraction grating with a period of 
A œ dop œ 10-74-10- cm. The oscillating dipole that is 
produced by the electron’s ‘image’ radiates in the optical 
range at low velocities of the electron and in the X-ray re- 
gion at relativistic velocities. Again, at sufficiently high 
electron densities it is in principle possible to obtain ampli- 
fication and lasering in the X-ray range. 

We can thus perceive now some new trend in approaching 
the issue of short-wave lasers: uses of radiation produced by 
relativistic electron beams in various spatially periodic 
structures. Such an alternative approach will perhaps result 
in the exponential growth of the frequency of coherent radia- 
tion sources as shown in Fig. 4. Of course, this approach, 
just like that using the population inversion of multiply 
charged ions and nuclei, will require sufficiently high pow- 
ers. Production of high-power electron beams is however 
well established in experimental nuclear physics; therefore, 
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Fig. 4 Evolution of coherent sources of electromagnetic radiation 
from the viewpoint of shortening the radiation wavelength [58] 


we can expect much progress with the generation of coherent 
X-rays even in the current century through the happy mar- 
riage of the potentials of laser and nuclear physics. 


9. Conclusion 


Concluding this brief outline of the origins of quantum elec- 
tronics, its state-of-the-art and future trends and applica- 
tions, we could say that this science not only exploits sti- 
mulated transitions to produce coherent light, but it has also 
given birth to many research areas in other fields that rely 
on stimulated quantum transitions. Owing to quantum 
electronics, physics now has a new branch that is sometimes 
called laser physics and that can also be called the physics of 
stimulated quantum transitions. 
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Nonlinear Physics. 
Stochasticity and Structures 


A.V. Gaponov-Grekhov and M.I. Rabinovich 


l. Introduction 
1.1 Historical Outline 


In the earlier stages of fashioning the quantitative picture 
of the world virtually all the branches of physics (save, 
perhaps, for mechanics) confined themselves to looking at 
the most stable and ‘conspicuous’ fragments of this picture— 
equilibrium or quasiequilibrium states and processes. And it 
was only later that the natural course of events in science, 
of necessity, required confronting with fundamentally dy- 
namic problems—the transition from one quasiequilibrium 
state into another, the formation and disintegration of bound 
states (nuclei, atoms, stars, and galaxies), and so on. The 
variety and complexity of physical problems at the ‘dynamic 
level’ turned out to be so staggering that at first assump- 
tions of weak fields and perturbations, etc., appeared to be 
quite natural and were employed as virtually the only prac- 
tical possibility to get rid of quasiequilibria. This was the 
case in optics, acoustics, electrodynamics and most other 
physical disciplines. This state of affairs seemed to be there 
to stay, because experimentalists (specialists in optics, acous- 
tics, and so on) had for the most part available to them only 
fairly weak fields. 

As a result, the superposition principle, i.e. the notion 
that the additivity of causes brings about the additivity of 
effects, became so deeply engrained that many even treated 
it as a universal key to the understanding and quantitative 
description of the majority of problems posed by nature be- 
fore physicists. In handling such issues, which did not in- 
volve marked deviations from equilibrium states, physics 
was concerned with linear (or rather linearized) dynamic 
models. And although even in the earlier days the limitations 
of the ‘linear physics’ were sufficiently obvious, the linear 
approach lingered on as the dominating one in physics. Ap- 
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propriate mathematical concepts and attributes, e.g. spec- 
tral components, eigenfunctions of boundary problems, etc., 
were so natural and graphic that they even began to appear 
as existing independently of the way of description. Some 
experience of dealing with these ‘nearly tangible’ images not 
infrequently enabled one to foresee the result without solv- 
ing the problem. 

But early in this century the situation changed dramatic- 
ally—the number of nonlinear problems whose solution could 
not be shelved indefinitely began to snowball. Whereas ear- 
lier these problems were only related to traditional nonlincar 
mechanics (the three-body problem, the description of waves 
on fluid surface), in the 1910-30s nonlinear problems came 
to the fore in acoustics, solid-state physics and statistical 
physics. Problems of fundamentally nonlinear nature 
emerged in the incipient radio engineering (detection and gen- 
eration of oscillations), they also occurred in other applied 
problems (automatic control theory). But the ‘nonlinear 
difficulties’ in disparate branches of physics and technology 
seemed to be specific for each problem and unrelated to each 
other. 

It was only in the 1930s, mostly due to L. Mandelstam 
and his colleagues, notably A. Andronov, that it was recog- 
nized that the situations in which we do not observe an addi- 
tive response to additive impacts (or in which a knowledge 
of an arbitrary number of particular solutions is insufficient 
to predict the motion of a system), i.e. in which the linear 
approach is no good, are no exceptions, on the contrary, they 
abound in science. It became clear that nonlinear problems 
from various spheres of physics, and not only physics, ap- 
peared to be quite similar and in need of a common approach 
and description. Among physicists of various specialities 
‘nonlinear thinking’ started to strike roots, and various 
branches of science started pooling their ‘nonlinear expe- 
rience’ [1, 2]. It was at that time that Mandelstam, van der 
Pol, Andronov and others originated a science describing 
nonlinear phenomena in lumped parameter systems of var- 
ious nature—known as the theory of nonlinear oscillations 
(3, 4], or nonlinear dynamics or the theory of nonlinear phe- 
nomena. 

To most physicists in those days it seemed quite natural 
to translate to the new nonlinear problems the powerful, 
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both formal and heuristic, approach of linear physics. Sig- 
nificantly, appreciable advances were really possible along 
this path, when the superposition principle had been ‘im- 
proved’ in such a manner that the nonlinearity was taken 
care of either by small corrections or by slowly varying the 
parameters of the essentially linear solution. One of the first 
triumphs here was the construction in the 1920s by van der 
Pol of a theory of the quasiharmonic oscillator in the radio 
range using a triode [5] and Peierls’s theory of heat conduc- 
tion of crystalline bodies that took care of the anharmonicity 
of atoms in the lattice (see, e.g. [6]). 

Events took a similar course in the nonlinear dynamics of 
distributed parameter systems. Of course, here too nonlinear 
processes permitting of no quasilinear description were 
known to researchers (examples are shock waves [7], waves 
on water [8], etc.); but up to the mid-1960s they did not 
seem to be dominating the scene and so experts were chiefly 
concerned with weakly nonlinear phenomena. These phenom- 
ena, as a rule, could be adequately described using conven- 
tional linear images, e.g. packets of harmonic waves or qua- 
siparticles (photons, phonons, etc.) smoothly varying under 
the influence of the nonlinearity (self-action) or changing 
one another (interaction). The only effective processes here 
are those for which hold the laws of energy and momentum 
conservation for quasiparticles in each elementary event of 
interaction; for harmonic waves corresponding to the con- 


servation laws are the synchronism conditions je) Oj, 
k (o;) = ky. 

Problems describable in terms of weakly interacting lin- 
ear objects (harmonic oscillations, waves, packets, etc.) 
are the subject of ‘quasilinear physics’. In addition to the 
now classical problems of the nonlinear dynamics of lumped 
parameter systems (radio, automatic control [5, 9]) it yielded 
spectacular results, which embraced the key issues of non- 
linear optics (generation of optical harmonics [10], genera- 
tion of coherent optical radiation in lasers [11], light self- 
focusing [12, 13]) and are central to the ideology of nonlin- 
ear development of! plasma instabilities (decay [14] and 
modulational [15] instabilities, nonlinear Landau’s damp- 
ing and amplification [16]) and to the theory of weak tur- 
bulence in hydrodynamics [17] and plasma [48], and so forth. 
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Equations of quasilinear physics are normally formulated 
for parameters of weakly interacting elementary linear ex- 
citations, or modes (e.g. amplitudes and phases of quasihar- 
monic waves). To construct these equations means so to cor- 
rect (locally in time and space) the parameters of linear modes 
that in the approximate solution the corrections computed 
in terms of perturbation theory would not significantly 
affect its main part. The procedure is possible because the 
growing, or resonance (containing small denominators), 
corrections are those which coincide with one of the natural 
modes of the unperturbed problem.! But in those cases where 
it is impossible to include all the resonances, the quasili- 
near approach fails and the researcher finds himself con- 
fronted by a ‘strongly nonlinear’ problem. In the 1960s and 
1970s such problems were formulated and solved in general 
relativity, unified theory of fields, astrophysics and, of 
course, hydrodynamics. Production and destruction of ele- 
mentary particles, evolution of the Universe, random be- 
havior in simple systems without noise and fluctuations, 
laminar-turbulent transition in fluid flows, and especially 
the evolution of biological organisms could not be squeezed 
into the framework of quasilinear physics. 

It is worth noting that not infrequently truly nonlinear 
problems arise in quasilinear physics itself! After we have 
made use of the fact that the problem is quasilinear, we ob- 
tain a new, simpler, dynamic model, which however in the 
general case no longer contains small parameters. Examina- 
tion of this model beyond the immediate neighborhood of 
equilibrium states forms the subject of nonlinear physics. 
To be sure, this new approximate model can also happen to 
include small parameters and we can go through the proce- 
dure again (this circumstance is at the heart of Khokhlov’s 
method of step-by-step simplification of nonlinear equations), 
but in that case our encounter with a truly nonlinear problem 
is only delayed until the next level of description. 


! In the process the summation that extracts resonances from the 
various orders of perturbation theory leads not to a qualitative re- 
structuring of the linear part of the solution but only to a slow vari- 
ation of the parameters of the constituent modes (e.g. amplitudes 
and phases of the interacting waves). This summation is central to 
virtually all methods of ‘quasilinear’ physics—the Krylov-Bogolyubov 
method [19], the Feynman diagram method [20], and so on. 
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In this article we are looking at two, in our opinion, most 
general and fundamental nonlinear problems—the onset 
of stochastic, turbulent behavior in dynamic systems (not 
subject to fluctuations) and the emergence of deterministic 
structures (often despite fluctuations) in nonlinear media. 
These problems, dealing with weather forecast, plasma heat- 
ing, biophysics, unified theory of fields, evolution theory, 
etc., are now concerns not only of physicists but also of math- 
ematicians. It was mathematics that in recent decades has 
made outstanding discoveries, which have enriched physics 
in this field with rigorous results and images. The most strik- 
ing among them are the soliton and the strange attractor, 
and in a wider context—the connection between motion in- 
stabilities and the emergence of chaos. 

Solitons are known to be solitary nonlinear waves prop- 
agating at a constant velocity with their shape unchanged. 
They were discovered by J. Scott-Russell during his exami- 
nation of waves in a canal in 1834 [21]. The concept of the 
soliton was given its present-day meaning in 1965 by N. Za- 
busky and M. Kruskal [22], who coined the term ‘soliton’ 
and uncovered the remarkable properties of the stability 
of these nonlinear objects. 

At about the same time a theory was worked out that showed 
that a randomness may occur in a ‘nonrandom’ (dynamic) 
system. The theory is due to A. Kolmogorov [23, 23a], who 
for dynamic systems introduced the concept of entropy bor- 
rowed from information theory. The earliest indications about 
a relation between instabilities and statistics can be traced 
down to A. Poincaré [24]. Congenial ideas were then pro- 
nounced by Birkhhoff, Krylov, Born, and Hopf. In the early 
1960s, notably by D. Anosov [25] and Ya. Sinai [26], ma- 
thematical and physical models were constructed for sys- 
teins with unstable paths, and a decade later there came the 
‘strange attractor’ as an image of stochastic motions of dis- 
sipative dynamic systems (see [27]). 

Now these problems are ‘hot’ spots of nonlinear physics. 
In general, looking back one can say that after the second 
birth of the soliton, the discovery of strange attractors and 
the stochastic behavior of dynamic systems, the unity of mech- 
anisms and models of self-organization in systems and 
media of various nature there came into being a new field 
of learning—the theory of strongly nonlinear phenomena. 
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1.2 The Unity of Problems 


What is common in phenomena so seeiningly disparate as 
laminar-turbulent transition in flows, self-organization, 
and solitons? The commonness is quite comprehensive, but 
of the greatest importance here seems to be the fact that all 
of these phenomena ¢all for in a sense a global description of 
a system, i.e. a description ‘in the large’, which requires that 
the nonlinear system be described not only in the neigh- 
borhood of individual states. We will now proceed to ex- 
plain. 

A well-known example of self-organization is establishing 
the steady-state propagation of a flame front. In the process 
of deflagration, a relatively thin region in which the chemical 
reaction of oxidation takes place, i.e. a region separating 
cold fuel from the deflagrating products, moves relative to 
the fuel with a constant speed, independent of the initial 
conditions [28]. Corresponding to the one-dimensional wave 
front of deflagration is a particular solution of a system of 
ordinary differential equations for stationary waves f(& = 
x — vt). In phase space these solutions are represented by 
a special path, a separatrix, which connects two equilib- 
rium states (Fig. 1), one of which corresponds to the values 
of the variables before the front (§ — +-co, the reaction has 
not yet started) and the other after the front (§ —> —oo, 
the reaction has finished). Similar nonlinear solutions also 
answer stationary solitary waves, to which in conservative 
media correspond solitons. Only the separatrix representing 
a solitary wave goes over not into another equilibrium state 
but returns to the same state, from which emerges the sepa- 
ratrix loop (Fig. 2). In situations, where stationary waves 
are described by equations of order higher than two, a soli- 
tary wave can have a complex inner structure (see Fig. 20). 


Separatrix 


Fig. 1 Schematic representation 
of solving a system of differen- 
tial equations in phase space 
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Fig. 2 Solitary waves: (a) 


‘simple’ and (b) with a com- 
t plex inner structure 





To describe both solitons and flame front propagation (also 
the structure of the shock wave front) it is thus necessary to 
have a solution of a nonlinear problem describing the pro- 
cess of transition from one equilibrium state to another and 
therefore defined on an infinite interval of the variables (e.g. 
the coordinate E= x — vt), which at the same time is bound- 
ed. It was this that was meant by the global description. 

The relation of stochastic dynamics to the global nature 
of a description, to the behavior of a system ‘in the large’, 
and hence the need to examine the general structure of the 
system’s motion, is yet more of principle. This question 
requires a detailed discussion since here we have to do with- 
out concepts that are so habitual, natural and physically 
meaningful. 

In fact, when dealing with an ensemble consisting of a 
large number of weakly interacting ‘nearly linear’ oscilla- 
tors, in accordance with traditional thinking, we can a prio- 
ri expect a random behavior in such a quasilinear system. 
This assumption, reflected in a corresponding hypothesis, 
e.g. the one about the chaotization of phases of individual 
oscillators, enables us to use an averaged description, which 
is concerned with intensities (numbers of quanta) of oscil- 
lators—the theory of weak turbulence (see e.g. [17, 18]). 
In actual fact there is no strong nonlinearity here, nor is 
here an ‘internal’ dynamic stochastization. The mechanism 
of randomness is not clarified but only hidden behind the 
original hypothesis. 

The stochastic behavior of dynamic systems is by no means 
a consequence of a superposition of a large number of indep- 
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endent motions. This phenomenon is also not related to the 
widely held notion of stochasticity, which is associated with 
the presence of something approximate, indescribable, ‘truly 
random’. The very phrase ‘dynamic stochasticity’ still seems 
to some physicists to be inherently contradictory—after 
all, dynamics applies to a completely deterministic process 
of the evolution of a physical system, whose past and future 
alike are uniquely programmed by the original equations and 
the initial conditions selected. If the system is complex (con- 
sists of a large number of subsystems), the statistical ap- 
proach can well be admitted as a way of describing complex, 
but in essence not random, motions. And if a system is 
simple, e.g. two interacting nonlinear oscillators, is random- 
ness possible here? And if so, whence is it? The answer sounds 
surprisingly—yes, it is possible, and the motion, indistin- 
guishable from ‘truly’ random (e.g. under a random external 
force), is a special case of the motion of a deterministic strong- 
ly nonlinear system! Physically, as we shall see later, this 
implies the following: although each path of the system is 
deterministic in the above sense, all (or nearly all) finite 
paths are unstable ‘in the large’, which implies that there 
is no information about the past of the system (initial con- 
ditions), whatever the finite accuracy of specifying this 
past. 

Stochasticity, or random motion of a dynamic system, is 
thus a result of instabilities of paths that lie in the whole 
region of phase space—in a bounded phase volume (not only 
near equilibrium states). To prove the existence of such 
paths and to predict the statistical properties of the corre- 
sponding motions requires a look at the general structure of 
motions of the nonlinear system under consideration. Hence 
the true nonlinearity of the problem. 

In connection with these problems, which require an ex- 
amination of the behavior of a system ‘in the large’, physics 
seems to be slipping into essentially nonlinear language—the 
language of the qualitative theory of differential equations 
and the theory of dynamic systems. This process started back 
in the 1930s, when Andronov linked periodic self-oscillations 
in nonlinear nonequilibrium systems with the limit Poin- 
caré cycles. These days such concepts as, say, separatrix, 
periodic path, Poincaré mapping, etc. in physicists’ minds 
are associated with quite real images, and bit by bit these 
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techniques become powerful tools of intuitive analysis in 
nonlinear physics, as is spectral analysis in linear physics. 
By and large, today’s nonlinear physics is noted for a fusion 
of rigorous analytical methods with those of qualitative 
theory and numerical experiment. 

This article is an attempt to give to a fairly wide physical 
audience an idea of two main problems of nonlinear physics— 
the origination of ‘complex’ motions in simple systems and 
the emergence of ‘simple’ (ordered) structures in complex 
systems—if possible without the advanced mathematics, 
concentrating on the conceptual side of the matter, the natural 
cost of this being a certain wordiness. For the very same 
reasons the bibliography in no way claims to be complete, 
and, with a few exceptions, it includes reviews and mono- 
graphs. 


2. Stochasticity 
2.1 General 


The traditional view that science is called upon to search 
for simple relations and dependences in the involved and 
tangled picture of the surrounding world, and perhaps the 
natural striving for beauty and order, have long been re- 
sponsible for the fact that physics has focused all its atten- 
tion on relatively simple processes (motions). It seemed that 
precisely these processes are realized in an overwhelming 
majority of practically and fundamentally important sys- 
tems and only in ‘truly complex’ systems, ones with a large 
number of degrees of freedom, physics has (perhaps even 
because of the weakness of its mathematical tools) to resort 
to a statistical description. This thinking, as mentioned, 
was a product of the ideology of ‘linear physics’—the super- 
position principle admitted no interaction of perturbations, 
which were responsible for their stochastization. 
Individual pieces of evidence that sufficiently simple 
systems in addition to simple periodic behavior demonstrate 
a complex, and even irregular, behavior have been known 
for a long time [29, 30]. By virtue of the above-mentioned 
habitual thinking, however, they failed to attract a great 
deal of attention. As noted in Introduction, things have only 
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changed in a fundamental way during the past one and a 
half decade; mathematicians and physicists joined forces 
to work out clear concepts of stochastic motions and pro- 
cesses in simple dynamic systems. In the formation of these 
concepts dominant role has been (and is) played by nume- 
rical experiments and computer mathematics, which can 
yield rigorous results [31, 32]. 

Significantly, one of the first numerical experiments con- 
cerned the stochastization of nonlinear dynamic systems. 
Fermi, Pasta and Uhlam used a first-generation computer 
MANIAC-1 to demonstrate the behavior of oscillators with 
cubic nonlinearity. They anticipated equipartition of the 
energy stored at t = 0 in one of the first modes (thermali- 
zation). As luck would have it, the model they used, as trans- 
pired two decades later [33], turned out to be close to an 
exactly integrable system, and so they observed no ther- 
malization—the model demonstrated the recurrence phen- 
omenon (the energy first transferred to higher modes and 
then again returned to the initial one). This result was in 
contradiction with the then current notion that any system 
with a fairly large number of degrees of freedom must sto- 
chasticize under nonlinear interaction and thus pass into 
a thermodynamic equilibrium. Therefore, the negative re- 
sult was accepted as a paradox, the FPU paradox. The Fer- 
mi-Pasta-Uhlam result had great implications not only for 
‘stochastic dynamics’ but also for the soliton theory (see 
below, and also [34]). 

Recent years have seen a resurgence of interest in ‘sto- 
chastic dynamics’. This is because a good many specific 
problems have appeared in various fields and because there 
is the promise of progress in the fundamental problem of re- 
lating dynamics and statistical physics, two fields which 
once were contrasted. The stochasticity of the motion of 
simple dynamic systems is responsible, for example, for the 
losses of charged particles in accelerators and plasma traps 
[35] (similar things occur in the Earth’s radiation belts 
[36]). Stochastic heating (see [37]) and stochastic accelera- 
tion of particles by a periodic field [38] are made use of. 
It has been shown that proper-noise generators (e.g. solid- 
state ones) with a retunable spectrum are possible [39]. 
Nowadays stochasticity, as were periodic self-oscillations 
half a century back, is unearthed everywhere [38, 40, 41]. 
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Physically, a discussion of the stochasticity of dynamic 
systems boils down to answering the three main questions: 

(1) What mechanisms are responsible for the chaos in the 
phase space of a deterministic system, i.e. what is its 
‘mathematical’ nature? 

(2) How does chaos emerge on changing the parameters of 
a physical system? 

(3) How does established stochastic motion behave?_ 

To begin with, we will take a simple example. 


2.2 Example 


To see how nonlinearity gives rise to random motion in a 
simple (with a small number of degrees of freedom) dynamic 
system, we consider the motion of an electron in a field of 
two periodic waves. It is described by the equation 


ı+sinz= a — 2 sin (kx + ot) (1) 


If there are no second wave (A, = 0) and no losses (a2 = 0), 
i.e. the system is Hamiltonian, the electron is known either 
to undergo periodic oscillations in the potential well created 
by the wave (trapped electrons), or to undergo low-ampli- 
tude oscillations and to move relative to the wave (flying 


electrons). In the plane (z, z) flying and trapped paths of 
an electron are divided by a separatrix (Fig. 3), which is 


x 


Separatrix 


23 ESS’ EN 
CADIS 7 


Fig. 3 The phase portrait of a nonlinear oscillator—an electron in 
the field of a harmonic wave; the paths within the separatrix corre- 
spond to trapped electrons; those beyond the separatrix, to flying 
electrons 
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an image of finite motions with infinite period. How will 
the electron behave in the presence of a second wave? 

If this is a wave of a small amplitude, it will only ‚slight- 
ly deform the motion of both trapped and flying particles. 
And only those motions that lie on the boundary between 
these two types of motion (paths near the separatrix) appear 
lo be sensitive to weak perturbations produced by the second 
wave. In fact, having descended from a potential maximum 
produced by the first wave, the particle gets onto the next 
maximum with a nonzero velocity, since the second wave has 
imparted to it a momentum during this period. Depending on 
the sign of that momentum the particle can either fly over 
the potential barrier or reflect from it to return to the ini- 
tial position, i.e. to be trapped by the wave. The motions 
corresponding to transitions between the potential maxima 
(separatrices in phase space) appear thus to be substantially 
perturbed: even a small external perturbation can make trap- 
ped particles flying, and vice versa. 

To give a mathematical description of the particle motion 


we will represent the latter in space z, x, t by viewing the 
pictures at secant planes t, = const (ta = tọ + nT, n= 
1, 2, ...) identified through the period T = 2n/w. Paths 


in the plane x, x form from the points of successive intersec- 
tion by this secant plane of the phase paths proper in the 


(x, xz, t)-space.t At A, = 0 the system (1) is autonomous 
and the picture of the paths on the secant coincides with 
the phase portrait in Fig. 3. It is seen in Fig. 3 that the sep- 
aratrices of the saddle points a and b coincide (if we take 
into account that x is periodic, then the equilibria a and b 
are one and the same state and there is a coincidence of the 
stable and unstable separatrices of the saddle). 

For A, Æ 0 the following will occur. In the general case 
perturbation removes degeneration, and separatrices split. 
If they do not intersect, nothing of interest will happen. 
We are interested in the case that they intersect. It is easily 
seen that if they intersect once, on the secant they intersect 
an infinite number of times. The fact is that separatrices on 


the secant plane x, x consist of points that transform into 


1 This method is known as Poincaré’s mapping method. 
16—01171 
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ı Fig. 4 (1) Unperturbed separa- 

trix and (2) homocline] path in 
the neighborhood of the separa- 
trix on the Poincare secant; (3) 
the boundaries of the stochastic 
layer 





each other in a period 7. If apoint belongs to two separatrices 
at the same time, then all, of its! future images (that cor- 
respond to moments nT, where n — oo) and preimages (as 
n — —oo) must also belong to two separatrices at the same 
time. It follows that either separatrices do not intersect at 
all (save for the point corresponding to the initial equilib- 
rium, and in the presence of a periodic perturbation, to a 
periodic motion) or intersect at an enumerable number of 
points (Fig. 4). These points belong to doubly asymptotic 
paths, which were called homocline by Poincaré—each of 
them as t— +o winds up and unwinds on the initial pe- 
riodic motion. The paths lying in the neighborhood of a 
homocline path in phase space are referred to as homocline 
structure. The structure possesses an infinite variety of 
paths, some periodic, some random. A complete description 
of paths belonging to a homocline structure has been given 
relatively recently Se the 1970s) in the language of symbolic 
dynamics (e.g. [42]). 

It is remarkable that ‘random’ curves belonging to a ho- 
mocline structure can be realized not only in phase but also 
in conventional (coordinate) space. One example is magnetic 
en of force in the vicinity of splitting magnetic surfaces 
43]. 

If we trace the evolution of a drop of phase fluid in the 
neighborhood of a homocline curve, we note that it de- 
forms in a complicated way and as t — oo it ‘spreads’ over 
the entire structure. This stems from the local instability 
of nearly all the paths inside the structure—the points 
that at {= 0 have been arbitrarily close to one another 
separate with ¢. Such an instability of paths confined within 
a bounded phase volume is responsible for the complicated, 
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tangled motions, thereby making them indistinguishable 
from a ‘truly’ random one. As a matter of fact, the phase 
volume being bounded, nearly any open path will come ar- 
bitrarily close to itself in a sufficiently long period of time; 
but because of the instability it does not follow from this 
closeness that the next stage will be like the previous one. 
On the contrary, a small perturbation will grow and the 
further path of the system will be virtually independent of 
the previous sections, just as with ‘true’ random walk. 

This reasoning also suggests the manifestation of insta- 
bility mentioned above—it is impossible to reproduce the mo- 
tion of a system with unstable paths by specifying at a cer- 
tain time its location with an arbitrarily high, but finite, 
accuracy [44, 45]. 

Random aspects in the behavior of a ‘nonrandom’ non- 
linear system are thus associated with two circumstances: 
first, nearly each open path within a bounded phase volume 
is random in a sense; second, in a natural manner there ap- 
pears the concept of ensemble, so habitual for applications 
of probability theory. Here we have an ensemble of various 
sections of unstable paths within a bounded volume. It can 
be shown that with increasing time t = nT the number of 
types of path sections in such an ensemble grows in an ex- 
ponential manner: N (t) œ exp (ht). The exponent h is the 
characteristic of the degree of stochasticity of the nonlinear 
dynamic system akin to the notion of entropy in information 
theory (it is called the Kolmogorov-Sinai entropy). If h >O, 
the dynamics of the nonlinear system will be chaotic, at 
h = 0 the system will only display simple motions. Note 
that the term ‘simple’ includes not only periodic but also 
quasiperiodic motions possessing the property of ergodicity, 
i.e. the ergodicity of motion (a time average over the path 
is equal to an ensemble average for all the paths) is an insuf- 
ficient condition for randomness. 

We hope that the example just discussed has convinced 
the reader that the motion of a stochastic dynamic system 
(a system with unstable paths that wholly lie within a bound- 
ed volume of space) is indistinguishable from a truly ran- 
dom motion, say the Brownian motion of a particle. If there 
are any differences, they are just formal in nature. Suppose 
it is possible to specify many times the same initial condi- 
tions for a stochastic dynamic system and for a Brownian 
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particle. Clearly,:.in this thought experiment there will be 
a difference; the initial conditions being identical, the dy- 
namic system will of course have identical paths;(il is in 
this sense that the motion of the dynamic system will be 
completely determined); the motion of a Brownian particle 
will be irreproducible. This,difference, however, has no phys- 
ical meaning, since, it is impossible to reproduce initial 
conditions with infinite accuracy. But any arbitrarily small 
inaccuracy will result in an irreproducible ‚individual mo- 
tion of the stochastic system due to its instability. 

It should be stressed that the likeness of a path of a dynam- 
ic system lying in a bounded phase volume to the motion 
of a Brownian particle is not simply an analogy. There exist 
dynamic systems [46] such that for them a distribution of 
probabilities in any sufficiently large time interval will be 
reduced to the distribution of probabilities for a Brownian 
particle—the Gaussian distribution, which, it is well known, 
is a manifestation of the law of large numbers in probability. 


2.3 The Stochasticity of Hamiltonian Systems 


If there exists a homocline structure in the phase space of 
a Hamiltonian system, the latter is bound to be stochastic. 
But to prove the existence of such a structure in each spe- 
cific case is far from easy. If the system represents a weakly 
perturbed nonlinear oscillator (the above example of the 
motion of an electron in the field of two waves, one of them 
being much weaker than the other, comes exactly under this 
heading), we can find out whether or not there exists a homo- 
cline structure using again perturbation theory, but only sub- 
stantially nonlinear. In that case, for the unperturbed solu- 
tion we take the strongly nonlinear solution associated with 
the separatrix loop in the phase plane (see Fig. 1). There 
is a connection [47] between the existence of the structure 
and the existence of zeroes of the function A, (tọ) that char- 
acterizes the distance between perturbed separatrices (a 
small dissipation can also be viewed as a perturbation). 
If A, (to) is an alternating function, then in the phase space 
of a perturbed system there exists a homocline structure. 
Specifically, for a system represented in the form 


dz/dt = U (x, y) + eu (a, y, t). 9 
dy/dt = V (x, y) + ev (z, y, t), € <1, (2) 
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we have 

Ag (to) = | {u [xo (t— to), Yolt— to), t] 
xX V [zo (t— to), Yo (E — to)] —V [To (t — to), Yo(t— to), tl 
X U [x9 (t— to), Yo(t—to)]} dt. (3) 


Here zo (t — to), Yo (t — to) is the separatrix of an unper- 
turbed system from saddle to saddle, t, is the parameter char- 
acterizing the position of a point on that separatrix. 

In the example of the electronin the field of two waves, 
when the amplitude of one of them is far smaller than the 
other (see (1) at e = A,/A, <1), we get 


A, (to) = { 8a + i (810, sin 2098) 


x [cosh”! (w7) — sinh”! (ol) R 


u Without losses (œ =0) A, (to) 
has always an enumerable number of zeroes and if the per- 
turbing wave A, has an arbitrarily small amplitude a sto- 
chastic layer emerges. It is difficult, however, to observe 
(even im a numerical experiment) stochasticity at small e, 
since the width of the stochastic layer is proportional to 
IA, (to) 

If an electron moves in||the field of two waves whose am- 
plitudes’are of the same order (A, ~ A,), the region occupied 
by stochastic paths will dominate in the phase space. This 
is illustrated-in Fig. 5a, where the results of a numerical 
count are provided—the Poincaré mapping for the equation 


where w, = 0 (7 7%4:) 


T+ar—sinzsinot=0, (4) 


that describes the electron’s motion in the field of a stand- 
ing wave (at a = 0). It is seen that on the secant plane one 
path is represented by a random set of points (the Kolmo- 
gorov-Sinai entropy, which characterizes the averaged, over 
the stochastic set, degree of instability of paths, h ~ 0.0428). 
Regions without points correspond to ‘islands’ of regular 
dynamics. 
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Fig. 5 (a) Traces of phase 


paths on the secant x, x for 
equation (4) ata = 0 and 
(b) an example of the phase 
portrait (on the secant 
plane) of a Hamiltonian 
system with stochastic 
behavior 





This example illustrates the emergence of stochasticity 
in Hamiltonian systems, which are in a sense simpler to de- 
scribe analytically than dissipative systems (by virtue of 
the conservation of the phase volume). Note that the phase 
volume of Hamiltonian systems with stochastic behavior is 
nevertheless quite complicated in structure. It has alternat- 
ing stochastic and regular (periodic or quasiperiodic) re- 
gions of motion—the so-called separated phase space (‘is- 
lands’ of stability alternate with stochasticity regions; 
see e.g. Fig. 5b). In simple situations, as we have seen, a 
small perturbation results in forming a thin stochastic layer. 
But in systems with more than two degrees of freedom at 
arbitrarily small perturbations stochastic layers form a com- 
mon complex set (network) within which a specific insta- 
bility occurs—the Arnold diffusion. At large perturbations 
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(interaction energies) a developed stochasticity is ob- 
served—the stochastic motion is realized nearly for all initial 
conditions, i.e. the measure of islands of regular dynamics 
in the phase space is not large. 

For Hamiltonian systems it is possible to answer the ques- 
tion about the boundary of stochasticity in the parameter 
space (the criterion of overlapping of nonlinear resonances 
[41]), and also to examine the structure of the stochastic 
component in the phase space, specifically, to determine the 
rate of stochastic diffusion [48]. 


2.4 Strange Attractors 


At first sight the stochastic sets of Hamiltonian systems 
have very little in common with the behavior of dissipative 
systems (which are precisely the systems especially common 
in nature and engineering). We are used to the fact that, as 
a rule, dissipation makes a motion simpler and more 
‘smoothed off’, and as a result, as t —> oo, establishes either 
an equilibrium mode or a mode of regular oscillations.” At the 
same time, however, other speculations seem natural as well. 
It is argued, for example, that a periodic path of a Hamilto- 
nian system with an arbitrarily small dissipation and energy 
pumping to compensate for the dissipative losses can turn 
into a limit cycle, and the stochastic set of the Hamiltonian 
system in a similar situation may be not destroyed and may 
not vanish; instead, it may become attractive, i.e. become a 
stochastic attractor. Strictly speaking, this is not the case 
in reality—the islands of regular behavior available within 
the homocline structure contain periodic paths, which in the 
presence of dissipation (at least in systems with one and a 
half and two degrees of freedom) become stable cycles. To 
them, as {— oo, tend almost all the paths belonging to the 
neighborhood of a homocline structure; in other words, if 
a small nonconservativeness (dissipation plus pumping) is 
introduced, the chaotic dynamics, which corresponds to the 


2 This viewpoint had been widespread among most physicists 
and mathematicians up to the early 1960s. A hypothesis was even 
proposed that in ‘good’, or ‘coarse grain’, systems only equilibrium 
states and limit cycles can be images of limit motions (ones that 
establish as t— co). The term ‘coarse grain’ was introduced into 
qualitative theory by Andronov and Pontryagin [49]. 
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‘walk’ of the representative point over the homocline struc- 
ture, must be transient, i.e. unobservable as t > oo. 

Nevertheless, these intuitive concepts to a large extent 
appear to be correct. The fact is that areas of attraction 
of stable cycles are quite narrow and involved, so that even 
insignificant noise shifts the system into the region of tran- 
sient stochasticity, making it {virtually attractive, i.e. a 
stochastic attractor. As dissipation extends, the stable cycle 
can in principle merge with the unstable ones (there is an 
enumerable number of them in the neighborhood of a homo- 
cline curve) and vanish. Then in the phase space of the dy- 
namic system will emerge an attractive stochastic set, con- 
sisting essentially only of unstable paths and containing no 
stable ones. Ruelle’ and Takens dubbed this phenomenon a 
strange attractor. 

The possibility of existence of attractive sets that only 
contain unstable paths appears to be an absolute paradox. 
Indeed, this possibility suggests that unstable motions are 
observable at long times, including those as t > oo. But at 
the same time it would be quite reasonable to state that un- 
stable paths are of no interest physically, since even quite 
small perturbations make them unrealizable. The answer 
to the paradox is that observed are not whole unstable paths, 
but sections of unstable paths belonging to the. closed attrac- 
tive set. Now small perturbations can direct the system lo- 
cally along another path, but they are unable to change the 
‘coarse grain’ characteristics of the motion at long times, 
which are determined by the very fact of the existence of the 
attractor and its stochastic properties. Such a behavior is 
quite common in dynamic systems—one may encounter it 
in long-term ‘weather forecasts, ecology”and the history of 
a people. 

Most models of real physical systems have ‘unrigorous’ 
strange attractors, ones which contain stable periodic paths 
with so small attraction regions that in neither computer 
simulation nor real experiment one finds this periodic mo- 
tion. Furthermore, homocline structures are available in 
any strange attractor. 

Mathematically, the compatibility of instability of all 
paths in a stochastic set with the set being an attractor, 
which”attracts other paths in its neighborhood, can be ex- 
plained as follows. In an attractor constituent paths cannot 
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Fig. 6 Unbounded growth of the 
phase volume that only contains paths 
unstable in all directions 





Fig. 7 An example of a sad- 
dle path: direction I is sta- 
ble, direction II is not 1,10) 





0) 


be unstable simultaneously in all directions—this would 
make the phase volume accommodating the unstable paths 
unbounded (Fig. 6). But the attractor is localized in the 
phase space, i.e. its unstable paths lie in a bounded region. 
This is only possible when the paths are unstable in some 
directions but stable (attractive) in others—saddle paths 
(Fig. 7). If we now suppose that all the saddle paths have 
their attractive directions ipointing outside the attractor, 
then we can picture, although crudely, an attractive struc- 
ture constructed from unstable paths, i.e. a strange attractor. 

The system being slightly nonconservative, the strange 
attractor can be compared to the stochastic set of the origi- 
nal Hamiltonian system. As dissipation'increases, it generally 
becomes simpler and resembles the set of paths localized 
near certain surfaces. This is how looks like, for example, the 
strange attractor of the system 


dzldt = —ox + oy, 
dy/dt = —zz + rz — y, (5) 
dz/dt = xy — bz, 


in which ‘nonperiodic behavior’ was discovered by E. Lo- 
rentz [30] back in 1963 when he investigated thermal con- 
vection in a horizontal fluid layer heated from below. It 
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i (a) 


Ga 





Fig. 8 (A) A path numerically obtained [50] in the Lorentz system at 
+ = 10, r = 28, b = 8/3 and (B) an illustration of the Cantor struc- 
ture of attractors on the secant; (a) each path in the bundle, if viewed 
under magnification, turns out to be a set of paths separated by free 
space; further detalization repeats the picture 


appears that the paths lie on two surfaces going over into 
each other in a random fashion. In reality the attractor has 
the Cantor structure, i.e. it is no surface, but an infinite 
number of closely spaced ‘sheets’ (Fig. 8). Significantly, be- 
tween any two sheets there are spacings that do not belong 
to the attractor. 

In recent years a remarkable success in studies concerned 
with the stochasticity of dissipative systems has been the 
revelation and recognition of the fact that whatever the 
infinite variety of nonlinear physical systems, the number 
of common ways of ‘going irregular’ when parameters are 
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Fig. 10 Eevee of a strange attractor through an infinite sequence 
of period doubling bifurcations of the initial motion: (a) a picture in 
the phase space when the parameter varies; (b) transition to stochastic- 
ity, through period doublings, as a phase transition (the dependences 
of various characteristics of the behavior of the system on super- 
criticality near the transition point are described by critical indices: 
for the Kolmogorov-Sinai entropy the index is 0.449 [59], for the spec- 
tral line width—2.42 [60]) 


changed is quite small.’ And the transitions often appear to 
be common not only for systems of different nature and differ- 


3 A similar picture is encountered in periodic self-oscillations— 
the number of main bifurcations giving rise to a limit cycle is quite 
limited. Specifically, a cycle can be born from an equilibrium—the 
Andronov-Hopf bifurcation, which corresponds to the ‘soft’ birth 
of self-oscillations; a cycle can also be born from the loop of a separa- 
trix of a saddle, or from a closeness of paths in the phase space (rigid 
mode of onset of periodic self-oscillations). 
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Uti) (b) 





Fig. 11 re ge of stochasticity through intermittence: (a) approach- 
ing of a stable and an unstable cycle in the phase space; (b) oscillo- 
gram of stochastic oscillations near the transition point 


ent complexity, but also for lumped and distributed sys- 
tems—fields. This is illustrated in Fig. 9, where are shown 
experimental situations and bifurcations corresponding to 
the main transition types. This is an infinite chain of period 
doubling bifurcations (Fig. 10), emergence of stochasticity 
on approaching, further merging and vanishing of a stable 
and an unstable periodic motion (intermittence, see Fig. 11), 
stochastic destruction of quasiperiodic motions and ‘rigid’ 
onset of stochasticity. 

The transition via doublings is especially common. It 
occurs as follows. As a result of interactions of oscillations 
of the main period with perturbations of the doubled period 
the initial limit cycle becomes unstable, and in its neigh- 
borhood a stable cycle of doubled period emerges. There can 
be an infinite number of such period doubling bifurcations 
in a bounded range of parameters, with the result that in 
the phase space a strange attractor comes into being (see 
Fig. 10). Recently Feigenbaum [58] has discovered the uni- 
versal nature of the transition to stochasticity for an in- 
finite chain of period doubling bifurcations of the initial 
motion. Interestingly, a close analogy was here uncovered 
between the transition from periodic self-oscillations to 
stochastic ones and the theory of second-kind phase transi- 
tions. As in the theory of critical phenomena, the charac- 
teristics of nascent stochasticity are conveniently expressed 
in terms of critical indices [59, 60]. 
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For example, the parametrically excited nonlinear os- 
cillator 


r+ar-+ (1+ bcos Qt) c+ 23=0 (6) 


and convection in a fluid volume heated from below, described 
by partial differential equations, demonstrate similar tran- 
sitions to chaos. This is due to the fact that near a transition 
point a multidimensional (and even infinite-dimensional) 
dissipative system behaves as a system of low dimension. 
The mechanism of this phenomenon, which is another of the 
many illustrations of the unity of dynamic laws of nature, 
is associated with the action of dissipation, which deprives 
most modes of their independence, as a result of which only 
several groups of linked linear modes are effectively involved 
in the transition. These combinations are essentially new 
modes, nonlinear ones. It is to be stressed that in the major- 
ity of cases the low dimension of a problem does not show up 
if expansions in terms of the modes of the linear problem are 
used, i.e. if we stay within the framework of ‘quasilinear’ 
physics. This low dimension is purely dynamic in nature. 

The dimension of a model that adequately describes a 
transition is related to the dimension of the stochastic set 
in the phase space. Whereas in Hamiltonian systems the 
dimension of the stochastic set equals that of the space, in 
dissipative systems in which the phase volume contracts on 
average, the dimension of such a set—a strange attractor— 
can be much smaller, even fractional. The latter is in need 
of some explanation. 

As we have already said, on a bounded stochastic set all 
paths must be of saddle type. The rate of approaching the 
saddle path or running away from it of the neighboring paths 
depends on direction, and for each direction it is character- 
ized by the path-averaged Lyapunov index A; = (In l; (t)/ 
l; (0) )/t (see Fig. 7). Corresponding to the unstable directions 
are. positive A; (jf = 1, 2, ..., S); to the stable, negative 
Ay (S <j < m, where m is the dimension of the phase space). 
The system being dissipative (the phase volume con- 
tracts on average), the sum of the positive and negative Lya- 
punov indices, equal to the divergence of the phase flux, 
must be negative. In order to introduce the dimension of the 
stochastic set we arrange the indices in the decreasing order: 
Ay >À, >... >Àm and define the dimension of interest 
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to us [61] as D = M +d. Here M is the serial number of 
the characteristic index that is ‘boundary’; the sum of M 


M 
indices is yet positive, i.e. > à; >0, but the sum of M + 1 
j=1 


indices is negative already. The fractional coefficient d is 
found from the relation A, + ... + dAy = 0. Quantity d 
is called the fractional part of the dimension of the attractor. 
And D = M + d is associated with the Hausdorff, or frac- 
tal, dimension [62]. (The quantity D used here is an upper 
a of the Hausdorff dimension of the stochastic set 
[611.) 

It is seen that the dimension of the strange attractor de- 
pends not only on the number of unstable directions but also 
on the integral rate at which the paths run away along them. 

Physically, it seems of importance to find a relationship 
between the dimension of the stochastic set and the value of 
the parameter characterizing the degree of nonequilibrium 
(e.g. the Reynolds number of hydrodynamics). But so far 
there are only tentative, quite excessive, estimates in this 
respect. 

If an attractor has dimension D > 2, the phase paths 
constituting it lie within a thin layer near some surface. 
This quasibidimensionality of the attractor enables motion 
over it to be described approximately using the one-dimen- 
sional Poincaré mapping, which relates the coordinate of an 
intersection of a path belonging to the attractor (with the 
secant plane, see Fig. 8) to the coordinate of the next inter- 
section: 2,4, = f (z,). Attractors with (D — 2) <1 in- 
clude, say, the Lorentz attractor, the attractor of the sys- 
tem (6) with small pumpings, and many others. 

It appears clear now that a system of any dimension, such 
that the dimension of its stochastic set is only slightly more 
than two, must demonstrate precisely such transitions to 
stochasticity which are found in one-dimensional mappings— 
period doublings, multiplicability [51, 63], and so on. One- 
dimensional mappings adequately model attractors that are 
close to two-dimensional ones. 

Also, D characterizes the closeness of the strange attractor 
in a weakly dissipative system to the stochastic set of the 
corresponding Hamiltonian system [64]. Such closeness 
(including the one in statistical characteristics) takes place 
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Fig.'12 The phase”portrait of the stochastic set of system (6) on the 


secant x, z [64]; (a) Hamiltonian system (œ = 0), fractal dimension 3; 
(b) strange attractor (a = 0.01), fractal dimension 2.8 


when an attractor has dimension close to that of the phase 
space, i.e. at D < m. This is illustrated in Fig. 12, where are 
depicted the portraits of the stochastic set of the Hamiltonian 
system (6) at a =O and the strange attractor emerging 
from it (at a = 0.01 the attractor has a fractional dimen- 
sion of d = 0.8). 

Corresponding to the motion on a strange attractor are 
nondecaying stochastic pulsations of a dissipative system. 
Since these motions are ‘powered’ by nonrandom energy 
sources and are independent of the initial conditions, it 
would be natural to call them stochastic self-oscillations, in 
analogy with regular self-oscillations, whose image is the 
limit cycle. After the ‘coarse grain’ nature of stochastic dy- 
namics in dissipative dynamic systems has been established, 
physicists started encounting stochastic self-oscillations at 
every step, and their mathematical image—the strange 
attractor—becomes commonplace not only for mathemati- 
cians and physicists, but also for specialists in all domains 
involving nonlinear dynamic systems. 

Now stochastic self-oscillations are observed and studied 
in periodic stimulations of biological membranes [65], Jo- 
sephson junctions [66], autocatalytic chemical reactions 
[67, 68], lasers [69], hydrodynamic flows (see below), and 
so on and so forth. 
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2.5 Turbulence 


In nonlinear Hamiltonian dynamics physicists have long 
been ‘teased’ by the challenge of justifying statistical mech- 
anics, and in the theory of self-oscillations such a challenge 
has been the emergence of turbulence. The concept of 
turbulence as self-oscillations is due to Landau [70] who 
assumed that turbulence transition at increasing the Rey- 
nolds number consists in successively complicating regular 
pulsations in a nonlinear unstable dissipative medium— 
hydrodynamic flow. According to Landau, turbulence 
comes about as follows: at Re = Re, a limit cycle with fre- 
quency œ, is born, for Re, >Re, a mode with frequency 
@, becomes unstable and two-periodic self-oscillations ap- 
pear; next appear new frequencies, and so on. The complex 
self-oscillations that emerge as a result of such a chain of 
transformations—bifurcations—are turbulence. 

In point of fact, Landau proposed to describe turbulence 
by an attractor in the shape of a torus with a quasiperiodic 
winding. To be sure, such an attractor is more complex than 
the limit cycle, but nevertheless this is quasiperiodic mo- 
tion (its spectrum is discrete), which does not correspond to 
the ideas of turbulent flow formed from experiments. But 
if the number N of incommensurable frequencies in the spec- 
trum of the flow is large, its realization appears to be ran- 


dom—the autocorrelation function falls off fast (as 1/Y N), 
and the time to its next maximum (Poincaré’s recurrence 
period) is T oc exp (aN) (a = const). And so even at N = 
10 this model of the emergence of turbulence would be 
quite justified. The difficulty here lies in the fact that this 
kind of motion of a self-oscillatory system is generally un- 
stable—an open winding on an N-dimensional torus nor- 
mally already at N = 3 either closes, turning into the limit 
cycle (the effect of mutual synchronization) or the torus dis- 
integrates to give rise to a strange attractor—the image of 
the flow having a continuous spectrum. 

The discovery of strange attractors instilled hopes that 
the problem of the emergence of turbulence had at last been 
solved, at any rate for some of the flow types. Figure 13 
shows the results of experiments on turbulence emergence in 
so-called internal flows (thermal convection in cells and 
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Fig. 14 Variation of the dimen- 
sion of a strange attractor, cor- 
responding to the turbulent 
Couette flow between rotating 
cylinders, with the Reynolds 


2 
10 12 14 16 
number Re/Re¢r [73] Re/Reg, 





in the Couette flow between rotating cylinders). The bifur- 
cations on the way to turbulence that were observed in these 
experiments correspond to the bifurcations of the emergence 
of a strange attractor (see Fig. 9), i.e. it can really be said 
that we understand processes near the transition point. 

Now we have the answer to the question that may well be 
asked: how does turbulence develop beyond a critical point? 
In the case of ‘soft’ emergence of turbulence (corresponding 
to this is the birth at the critical point of a strange attractor 
of small dimension: D > 2) as the Reynolds number in- 
creases the turbulence process involves ever new degrees of 
freedom of the flow, and hence the dimension of the strange 
attractor, which is the image of this flow, must grow to- 
gether with supercriticality [54]. Recent experiments with 
turbulent Couette flows [73] and thermal convection [74] 
yielded spectacular confirmations of these concepts. As is 
seen in Fig. 14, corresponding to the emergence of turbu- 
lence in the Couette flow between cylinders is the birth of a 
strange attractor, whose dimension really monotonically 
grows with supercriticality (at Re exceeding Re,;—when 
turbulence starts—by no more than 30%, among the infinite 
number of degrees of freedom of the flow, or modes, turbu- 
lence involves no more than five modes). 


4 In the experiments under discussion [73, 74], from a sufficiently 
long realization of the velocity field u (t) at a point (i.e. from one 
observed point) was reconstructed the geometrical image (‘phase 
portrait’) of the flow that had established as ¢ > oo in an m-dimensional 
space (that it is possible in principle to achieve this reconstruction 
was shown by Takens [75]). A point in the m-dimensional space is put 
into correspondence to each of the vectors u (tp) = {u (tp), u(t, + 
Dur. U [tp + (m — 1) tl}, where t is fixed. A shift in k corre- 
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Strange attractors are related to the emergence of turbulence 
in free shear flows, e.g. in a boundary layer, not as di- 
rectly as lor flows in cells. If in closed flows turbulence de- 
velops in time (and one really can obtain the required in- 
formation about flows by measuring the velocity field at one 
point only!), in free shear flows turbulence normally 
develops downstream, i.e. chaos develops in space. This re- 
quires solving not just an initial problem but a boundary 
problem as well, and we are interested in the pulsating (irreg- 
ular in time) flow. This is an incomparably more complex 
problem. Nevertheless, we can well say that for such flows, 
too, turbulence emerges and develops due to the nonlinear 
dynamics of the flow and, as for simple systems, it is 
eventually due to the instability of individual motions. 
External pulsations are a sort of ‘trigger’ for this instabil- 
ity. One of the most exciting and important problems here 
is to relate the spatial development of turbulence down- 
stream to the growth of dimension of the stochastic set in 
the coordinate and to understand to what (in terms of cha- 
otic dynamics) corresponds the transition of the turbulent 
flow into the self-similar regime. Only first steps in this di- 
rection have been made so far [149]. 


3. Structures 


3.1 Order out of Disorder. Self-Organization 


Just as now we are astounded by the emergence in simple 
nonlinear systems of stochastic motions, so in the 1920-30s 
researchers were shocked to find ‘spontaneous emergence’ in 
dissipative systems (e.g. a nonlinear amplifier plus an RC- 
chain) of periodic motions. But even for a present-day 
researcher the emergence of regular oscillations out of ini- 
tial disorder (initial fluctuations) in the absence of organizing 
actions does not seem trivial. In fact, even when develop- 
ing instabilities are resonant in nature there appear not 


sponds to the motion along the path in this space. An invariant set of 
paths corresponds to the attractor. The dimension of attractor, D, 
starting with a certain mọ, should be independent of m. But if D varies 
for sufficiently large m as well, then the realization under considera- 
tion does not correspond to motion on a low-dimension attractor. 
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strietly periodic perturbations but a whole spectrum of such 
perturbations. What mechanisms are used to single out reg- 
ular perturbations out of arbitrary ones? What features 
must be inherent in such an ‘antientropy’ system, which 
transforms the energy of the external nonperiodic (and in 
general nonregular) energy source into periodic motions? 

Now essentially the same questions of the classical the- 
ory of nonlinear oscillations are posed in connection with 
seemingly absolutely peculiar phenomena—the emergence 
of spatial order out of initial disorder and the spontaneous 
formation of complex spatial structures in a homogeneous 
nonequilibrium medium. These phenomena came to the fore 
beginning in the 1950-60s in connection with the problems 
of chemical kinetics and biology. Specifically, a qualitative 
description was given of waves in the cardiac muscle 
(1954, [76]) and in the model of morphogenesis (1952, [77]), 
of periodic oscillations in autocatalytic chemical reactions 
(78, 79]. In those years a theory was proposed for regular 
spatial structures in some hydrodynamic flows (Bénard cell 
in thermal convection, see e.g. [80]; Taylor vortices be- 
tween rotating cylinders [81]). It was found quite soon that 
the development of complex ordered formations in nonlin- 
ear media or spatial ensembles of various nature is described 
by similar models and solutions, see [82-84]. This enabled 
one (as so often in nonlinear physics and the theory of 
oscillations and waves) to apply the experience and knowl- 
edge accumulated, for instance, in dealing with deflagra- 
tion reaction, to the analysis of the propagation of popula- 
tions in ecology or the propagation of excitations in cardiac 
tissue. As a result, new notions and images appeared (dissi- 
pative structure, auto-wave, reverberator, etc.) and basic 
universal models started taking shape that describe the 
emergence and existence of structures. As a matter of fact, 
a new direction in nonlinear sciences has made its appear- 
ance; it is called nonequilibrium thermodynamics [82], 
synergetics [83], or self-organization theory [85, 86]. 

The traditional physical example of self-organization is 
the emergence in a fluid layer heated from below of a struc- 
ture of hexahedral prismatic cells (Bénard cells, see Fig. 45). 
For such a structure to form it is of principle that the nonlin- 
ear medium be nonequilibrium and dissipative—the devel- 
opment of convective instability results in the growth of 
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Fig. 15 (a) Hexahedral prismat- 
ic cells emerging in a fluid 
layer heated from below and (b) 
the evolution in time of the cel- 
lular convection in the fluid 
layer [80]: at first the dominant 
regime has six cells, next a five 
cell regime sets in as a result 
of nonlinear competition 


perturbations of the velocity and temperature field in a 
certain interval of space scales, then because of the compe- 
tition of scales (only possible when there is dissipation) 
survives a lattice of a definite scale alone. Hexahedra are 
formed when lattices with different spatial orientations 
have their phases synchronized (Fig. 16).! Neither the scale 
of the lattice nor the structure of the cells is practically 
independent of the conditions prevailing on the periphery 
of the layer, if its size along the horizontal is large enough. 

In biological systems self-organization is observed in rel- 
atively simple situations, for example in a homogeneous 
ensemble of amoeba-like cells [87]. Such cells exude cyclic 


1 The synchronization is also possible in fluids, where viscosity 
(surface tension or other diffusion coefficients) is temperature de- 
pendent. 
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Fig. 16 (a) Emergence of hexa- 
hedral cells through the super- 
position of three spatial lattices 
of the same period rotated rel- 
ative to each other by 60° and 
(b) the lines of current in the 
fluid layer under Bénard con- 
vection 





AMP about once in 5 minutes. But with enough nutrients 
the cells do not respond to this hormone and fare indepen- 
dently. Under more unfavorable conditions the following 
takes place. One of the cells starts exuding the hormone at 
a higher rate and it synchronizes hormone release by its 
closest neighbors, which in turn synchronize their neigh- 
bors, and so on. After having been excited by this hormone 
the cell starts travelling in the direction of the exciter. The 
result is the counterflow of the divergent wave of the stimu- 
lator, or synchronization, and the motion of the cells to- 
wards each other and this virtually gives rise to a new orga- 
nism. 

What is then self-organization? Without any claim at a 
rigorous definition, we will say that self-organization means 
the establishment in a dissipative nonequilibrium medium 
of spatial structures (which, in general, vary with time), 
whose parameters are determined by the properties of the 
medium itself and depend but weakly on the spatial struc- 
ture of the source of nonequilibrium (energy, mass, etc.), 
the initial state of the medium and, quite often, on bound- 
ary conditions. It is thus of principle for 'self-organization 
to occur to lose any memory of the initial conditions and to 
have a direct link of the parameters of the structure with 
the properties of the dissipative medium. 
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As is seen from the example, self-organization is a result of 
the development of spatially inhomogeneous instabilities 
with their further stabilization due to a balance between 
inner dissipative losses and inputs of energy (mass, etc.) 
from a source of nonequilibrium. The picture is resemblant 
of the establishment of self-oscillations in an oscillator! The 
difference is that in the LC-oscillator with lumped para- 
meters ‘self-organization’ occurs in time only, whereas here 
both in time and in space. It is no wonder, therefore, that 
the examples of self-oscillations in distributed systems (the 
most popular of them is the generation of pulses in the laser 
[11]) are at the same time examples of self-organization. But 
since the phenomenon of self-organization also covers the 
formation of static structures and of propagating fronts 
(travelling structures), i.e. ‘nonoscillatory processes’, the 
whole range of related problems are now described by the 
generic term self-organization, not self-oscillations. 

Owing to the fact that self-organizing processes occur in 
time and space, they are exceedingly rich in form. They 
include dissipative structures, solitary fronts (deflagration 
fronts [88], population waves), impulses (in nerve fibres 
[89]), leading centres and reverberators in cardiac tissue 
[90], depression waves in the brain [91], etc. 

Returning to the questions posed at the beginning of this 
section, we can note that the basic mechanisms that prim- 
arily determine the possibility of self-organization and of 
the generation of periodic oscillations in nonequilibrium 
systems have been identified already in the frame of ‘quasi- 
linear physics’. These are the mutual synchronization and 
mutual competition of elementary perturbations. The com- 
petition effect consists essentially in that perturbations that 
have different frequencies and derive energy from the com- 
mon source of nonequilibrium of the system (an external 
field, an average flow, etc.) in the stage of nonlinear interac- 
tion organize for one another additional effective absorption 
with the result that a perturbation becomes dominant that 
is less sensitive to the action of the remaining ones (not in- 
frequently it is not this perturbation that grows faster than 
the others in the stage of linear development). On the other 
hand, synchronization ‘effect in the simplest cases is con- 
cerned with the establishment in the ensemble of interacting 
perturbations that have at first slightly different frequencies 
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Fig. 17 ‘Chemical’ turbulence: (a) an ensemble of concentric waves 
in a two-dimensional reactor [94]; (b) stochastic instability of a spiral 
wave in a two-dimensional excitable medium [95] 


(or space scales) of the resultant oscillation of a definite fre- 
quency (scale). And so the independent oscillation mode in 
the ensemble turns out to be unstable in relation to the self- 
consistency of perturbations, 

On the face of it the formation of structures against the 
background of initial noise is a phenomenon that might 
with good reason be set against the birth of chaos in a deter- 
ministic system. In actual fact, both chaos and order are 
a result of a nonlinearity. Chaos occurs in systems with 
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unstable behavior; and order in system with stable behav- 
ior. As we have seen in the previous section, the transition 
from a stable behavior to an unstable one can come about 
even when the parameters of a dynamic system change but 
slightly, i.e. when they pass through a critical point. More- 
over, even rather far from a transition boundary emerging 
chaos still carries imprinted on it signs of the vanishing or- 
der. Examples of ‘stochasticized’ ordered structures are 
coherent structures in turbulent flows [92, 93], ‘chemical’ 
turbulence in the form of an ensemble of concentric or spi- 
ral waves (Fig. 17) [94, 95], turbulence of Langmuir waves 
in the form of a ‘gas’ of solitons [96], and so on. It seems 
that to view structures (order) as the opposite of chaos is a 
bit too categorical. It would appear more natural to learn 
to describe various levels of order or various degrees of 
chaos. 

Structures, regular or chaotic, in nonlinear nondissipative 
media and in media with a dissipation in their properties 
turn out to be fundamentally different. Therefore, it makes 
sense to consider them separately. We will begin with dissi- 
pative structures. 


3.2 Structures in Excitable Media 


Among nonequilibrium dissipative media prone to self- 
organization (formation of structures) the most often occurr- 
ing in nature are the so-called excitable media (specifically 
in biology and chemistry). In comparison with conventional 
wave media they are distinguished by one feature of princi- 
ple—their constituent elements can only be found in several, 
qualitatively different, states. So these may be ‘excitation’, 
‘silence’ and ‘restoration’ (refractoriness) states. But links 
between elements (as a rule, dissipative) make possible a 
transition of a given element of the medium, under a per- 
turbation transferred from a neighboring element, from the 
initial state to the next one. It is this feature of excitable 
media that determines the structure of the dynamic systems 
describing them (differential equations). Suppose that the 
relaxation of a medium element from one state to another 
is described by the simplest model possible—the first-order 
equation 
dulat = f (u, a), 
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where f (u, œ) normally has a declining region (N-like char- 
acteristic). Then if we take into account the space-isotropic 
connection between such elements, we will arrive at the 
diffusion-type nonlinear kinetic equation 


ðulðt = f (u, a) + DAu. (7) 


In the general case, the diffusion coefficient D depends on 
u. If the medium under discussion is described by several 
variables, then u is a vector, and D (u) a matrix. Precisely 
such equations have been obtained by Hodgkin and Huxley 
[97] to describe the propagation of an impulse along a ner- 
vous fibre (the components of u here are electric field and 
concentrations of positive and negative ions along the fibre). 

If the processes in the excitable medium can be regarded 
as one-dimensional, (7) degenerates into 


öulöt = f (u, a) + Dö?ulda?. (8) 


This model was studied as early as the 1930s by Zel’dovich 
and Frank-Kamenetsky for the deflagration problem [98], 
and by Kolmogorov, Petrovsky and Piskunov [99], who de- 
scribed using (8) the propagation of epidemics. The trivial 
solution of (8) corresponding to a spatially homogeneous 
state of the system becomes unstable for a certain value of 
a, and an inhomogeneous distribution sets in. This may, 
specifically, be a stationary wave (flame) u = u (x — vt) 
propagating with a velocity v, which is higher than the 
diffusion rate. 

It is of principle that spatial structures in systems of the 
type (7) also emerge in those cases where each medium ele- 
ment is locally stable, i.e. stable is the trivial spatially 
homogeneous solution of (7) (Au = 0). Dissipative structures 
then develop as a result of diffusion instability [77] due to 
Turing. So in a two-component system of the type (7) 


du,/dt = au, — bu, — >} Dyjuj, (9) 
Ou,/dt = bu,—cu,+ > Dy juj, 
that is stable at D;; = 0, at Di2, Day < O there is always 
aer Such that for a >a,, nontrivial spatial structures will 
appear. 


It is the diffusion connection between the elements of an 
excitable medium that is responsible for the concentric waves 
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arising under certain initial conditions in a two-dimen- 
sional medium. Their structure is determined by the pres- 
ence of a leading centre that spontancously arises in the me- 
dium, the latter being stable in the absence of diffusion. 
This can be explained as follows. Each element is capable, in 
response to an external excitation at an appropriate phase, 
of producing an impulse; then it becomes silent. If two such 
elements are linked owing to diffusion, they can alternately 
‘trigger’ each other. As a result, the diffusion connection 
between two nonoscillatory (‘dead’ [100]) elements in the 
system gives rise to ‘life’—self-oscillations. 

Still more diverse are phenomena in systems where diffu- 
sion is sensitive to component concentrations—nonlinear 
diffusion. In particular, in systems in which nonlinear dif- 
fusion is determined by the concentration of ‘foreign’ com- 
ponents it is possible to observe a quite unexpected new 
phenomenon—spatial differentiation of interacting compo- 
nents. By way of example we take a two-component medi- 
um [101] 


Ou,/dt = — div Bat, Vs, 


Ouz/dt = — div BoU Vl, Pisze = const, (10) 
If the nonlinear diffusion coefficients have the same sign 
(Bas Big >> 0), it follows from these equations that there 
exists an instability of the spatially homogeneous distribu- 
tion, which, unlike Turing’s, should naturally be called 
forced [101]. For Bis, Ba >O, as an instability develops, 
various components concentrate within the same regions 
of space—the combination effect. But if Biz, Ba <0, the 
perturbations u, and u, have spatially opposite phases and 
as perturbations increase the various components are differ- 
entiated ever more until they stratify at a nonlinear stage. 

Even the basic, elementary phenomena of self-organiza- 
tion in two-dimensional excitable media still cannot be 
completely described. One example is spiral waves, observed 
in arrhythmias, in chemical and biological media. 

Spiral waves, as a rule, emerge in homogeneous media 
that are composed of diffusion-related active elements with 
a finite excitation time and finite restoration (refractoriness) 
time. The self-similar solution of (7), which describes a spi- 
ral wave in a two-dimensional medium, has the form u = 
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Fig. 18 A spiral vortex with topological charge 4. The picture shows 
its evolution in time with a period of 4 min (horizontal size of the 
field is 28 mm) [102] 


= F (N8 — ot, r), where O and r are polar coordinates, and 
N is the number of azimuthal mode (N is also referred to as 
the topological charge). The spiral wave (‘vortex’) with 
N >1 on a plane appears as a multiarm spiral (Fig. 18). 
Clearly, in a homogeneous medium there must exist left- 
and right-handed spiral waves, the symmetry being a con- 
sequence of the symmetry of the equations. The handedness 
of the spiral wave depends on the way of excitation or ran- 
dom initial conditions. Of especial interest here is the issue 
of selection of the possible stationary states (note that along 
with spiral waves excitable media can have cylindrical 
waves). For a strongly nonlinear medium, described by (7), 
the selection problem has not yet been solved. One can 
only hope, drawing on the experience of quasilinear physics, 
that here too selection is a result of the well-known effect 
of competition between nonlinear modes, which derive ener- 
gy from a common source—the dissipative excitable medium. 


3.3 Order Level 


Stationary fronts and impulses, concentric or spiral waves 
are examples of elementary, completely ordered structures 
observed in nonequilibrium dissipative media. There is ex- 
perimental evidence, however, for more complex nonstatio- 
nary structures (sometimes they can be viewed as a result 
of an interaction of elementary ones), for which it is quite 
difficult to draw a clear demarcation line between chaos and 
order. For example, take a look at the ‘chemical turbulence’ 
picture in Fig. 17, which is an ensemble of ring waves collid- 
ing with one another in a two-dimensional reactor in which 
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an aulocatalytic reaction proceeds. There is no saying deli- 
nitely which one finds more here, chaos or order. The same is 
true of the heart fibrillation by Rayleigh-Bénard nonstation- 
ary cellular convection, the reconnection of vortices in 
the Hele-Shaw convective cells [103], and so on. 

It seems quite unlikely that any universal parameter is 
possible that would enable the order level to be quantita- 
tively estimated in self-organized systems of arbitrary type. 
We will therefore confine ourselves to systems that consti- 
tute a nonlinear nonequilibrium medium bounded in space. 
Significantly, the majority of such media are characterized 
by a finite time t of restoration of the nonequilibrium prop- 
erty (refractoriness time). The propagation velocity V and 
time t being finite, in a nonequilibrium medium there al- 
ways exists a minimal volume with a characteristic scale 
loc Vt, within which spatial excitations cannot but be 
completely correlated. The number of such elements n in a 
spatially bounded system is finite (in many cases it is even 
not very large), therefore instead of an initial distributed 
system described by partial differential equations it is only 
natural to consider its finite-dimensional analogue—a dynam- 
ic system with N degrees of freedom, where N = np (p 
is the number of degrees of freedom for an elementary vol- 
ume, or a ‘point’ system). 

For such a dynamic system the phase-space counterpart of 
the structure that is a result of self-organization is the limit 
attractive set, the attractor. Simple attractors (equilibrium 
states and stable limit cycles) correspond to completely reg- 
ular structures; stochastic sets (strange attractors) to chaot- 
ic or partially ordered structures. The limit sets correspond- 
ing to the structures can have significantly difierent di- 
mension: zero—the equilibrium state, unity—the limit cy- 
cle, k—the open winding of a k-dimensional torus. Also differ- 
ent can be the dimension of stochastic sets (see Sec. 2.4): 
with negligible dissipation it virtually coincides with that 
of the phase space, and with sufficiently strong dissipation 
it can go down to about two. 

It is easily seen that it is the dimension of the limit set 
that characterizes the level of order of nonstationary struc- 
ture. In fact, it is natural to introduce as a parameter deter- 
mining the level of order a quantity that estimates the 
uniqueness of the connection between independent variables 
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describing the state of various elements of the system.” The 
situation in which with 2— oo all the variables appear to 
be interrelated by an algebraic dependence corresponds to 
the stationary, spatially bounded structure, called limit 
order. Corresponding to incompletely ordered structures is 
a situation in which as t > oo the relation between several 
variables remains a functional one in time, and so the di- 
mension of the limit set remains to be more than two. It is 
thus beyond doubt that the level of order is associated with 
the dimension D of the limit set embedded into the phase 
space of the dynamic system with N degrees of freedom. The 
case where the dimension of such a set is equal to the dimen- 
sion m of the phase space is precisely the case that corre- 
sponds to the completely disordered structure. As the dimen- 
sion of the limit set—the strange attractor’—decreases, the 
level of order of motion increases. 

As noted, the dimension D of the strange attractor can 
vary from two to m (the dimension of space). If for sufficiently 
large m the dimension of the attractor is D > 2 (the paths 
forming the attractor lie within a thin layer near a certain 
surface), then the majority of variables in a system with a 
high degree of accuracy are algebraically expressed through 
each other (local relation), and the level of order in a non- 
stationary structure, whose image is such an attractor, is 
high. At D ~ m in a dissipative system a regime is reali- 
zed with a maximally possible level of disorder; D = 1 cor- 
responds to the absolute order, or periodic structure (its 
image is the limit cycle). 

As one of the possibilities of the quantitative expression 
of the level of order of nonstationary structures we can con- 
sider the following combination of m and D: P = (m — D)/ 
(m — 1); in the limiting case of D— m the parameter of 
oe to zero, for D > 1 it can be close to unity 
[ ; 


2 Such elements may, for example, be eigenfunctions of a boundary 
problem—the spatial modes. 

® The limit set of formal interest to us may be either a strange 
attractor or an open winding of a multidimensional torus, but an 
attractor in the form of the open winding is not ‘coarse grain’, there- 
fore later in the article we will not discuss it. 

@ At first sight, the definition of P contains a significant measure 
of arbitrariness—as the description of a system becomes more detailed 
the number of degrees of freedom N, and hence the dimension of the 
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Fig. 19 Variation of the level 
of order of nonstationary struc- 
tures (solid line) and the dimen- 
sion of the attractor (dash line) 
with the coupling constant for 
oscillators (B = c=1.73, N=9) 
0 0.33 0.66 1 e [54] 





We will now show that structures of various levels of 
order can exist. Let us take a simple example: a model me- 
dium that is a chain of related active nonlinear elements [54]: 


da;ldt =a;— (1+ ip) Ja,]?a;+e(1— ic) (aj+ 
+aj4—2a)), B, c, e>0, (41) 


with periodic boundary conditions a; (t) = aj+ y (t), j= 
1,2, ..., N. Equations of the form (11) are to be found 
in many fields of nonlinear physics. Specifically, in the 
long-wave limit equation (11) becomes the well-known Ginz- 
burg-Landau equation [105], and a change a = u + iv in 
the same limit reduces the equation to the so-called 
(A — w)-system, which describes various self-organization 
phenomena [106]. The equation in the form (11) is used, for 
example, in modelling the dynamics of Taylor vortices in 
the Couette flow between rotating cylinders [107]. 

A numerical experiment has shown that in this system 
nonstationary structures are possible, whose images in the 
2N-dimensional phase space are limit cycles, two- and three- 
dimensional tori and strange attractors of various dimen- 
sions. Figure 19 shows the growth of the level of order of 


phase space, m, increase. And the level of order of a specific structure 
can artificially be brought up to the maximum value <1. This arbi- 
trariness, however, is a seeming one—the number of independent 
variables should include only those variables whose interdependence 
does not relax quickly to the algebraic one. If for times much shorter 
than those needed for the structure to establish, individual variables 
are algebraically related, then a priori they should be considered inter- 
dependent. 


chaotic structures in a ring 
chain with N = 9 as Ithe 
interdependence between 
elements of the ‘medium’ 
becomes stronger. Also 
represented in the figure is 
the dependence of the 
Hausdorff dimension of the 
strange attractors that cor- 
respond to these structures 
on the coupling constant 
for the oscillators. In this 
one-dimensional medium 
both substantially disor- 
dered and nearly regular 
nonstationary structures are 
possible (their spatial shape 
is shown in Fig. 20). Essen- 
tially disordered structures 
can emerge both evolution- 
ally (a chaotic structure 
transforms gradually, as a 
parameter varies, into a re- 
gular one), and abruptly, 
by steps (a regular structure 
is replaced by a structure 
that already has a low level 
of order). Structures were 
uncovered that represented 
a mixture of various forms 
of structures—intermittent 
structures with various spa- 
tial periods were observed. 

Significantly, the level of 
order of a nonstationary 
structure can remain virtu- 
ally unchanged as the num- 


Fig. 20 The growth of order in 
spatial structures as the para- 
meter e > 0.125 decreases (from 
the top downwards) in a ring 
chain with N=50 (B=c= 
1.71) [54] 
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ber of various elementary excitations (modes) contributing to 
its formation (e ~ 0.3-0.6 in Fig. 19) increases; but it can drop 
markedly as the number changes slightly (e ~ 0.2-0.3 and 
e =~ 0.7-0.8 in Fig. 19). Mechanisms of these phenomena are 
now known fairly well already [54, 55]. The case of the un- 
changed level of order of structures, as the number of inter- 
acting modes increases, corresponds to the situation where 
new modes completely or partially synchronize with old 
ones by just changing the shape of the existing nonlinear 
structures. But reducing the level of order, with the spec- 
trum of interacting modes unchanged, is associated with 
destruction of these structures. 

In unbounded nonequilibrium media stationary structures 
(ones that move at a constant velocity with their profile un- 
changed) can also have different levels of order. For one- 
dimensional media (simple space-time analogy) this seems 
especially natural. For instance, in the presence of an ex- 
ternal field of the form sin (wt — kx) waves in a one-dimen- 
sional excitable medium are described by the nonautono- 
mous diffusion equation 


du/at = f (u) + Dö?ulda? + q sin (Qt — kz). (12) 


In this case stationary waves propagate at v = Q/k and 
obey the equation of nonautonomous anharmonic oscillator 


Duss + vug + f (u) = q sin k$. (13) 


Depending on the parameters, this system can demonstrate 
all the forms of attractors possible, and so in the model (12) 
can be realized both regular and irregular running structures 
with various levels of order. 

Unlike nonstationary structures in bounded systems an 
image of stationary structures in the phase space (7) is not 
just the attractors but also the transient paths that ‘con- 
nect’ the sets that become limit ones, as &— — oo, with the 
attractors (attractive sets as § - +00). 

It is quite obvious that, there being a rich variety of struc- 
tures in nonequilibrium media, the P-type parameter of 
the ‘order level’ alone is insufficient to characterize the 
structures. Structure may also differ, for instance, in the 
degree of self-similarity, amplitude, ‘concentration’ of vari- 
ous forms of structures in their mixture, and so forth. In 
this connection, it appears to be of importance to derive 
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self-consistent equations for a complete system of parame- 
ters describing the evolution of structures in time. But vir. 
tually no results are available in this respect, however- 


3.4 Shock Waves and Solitons 


It is worth noting that nontrivial wave structures emerging 
in nonlinear media began to be studied in physics long be- 
fore self-organization became a topic of great interest. As 
early as the 1930s first studies on quasistationary shock 
waves in gases [108] appeared. In the 1950s shock waves were 
investigated in plasma physics [109] and magnetic hydrody- 
namics [110, 111]. Also studied were electromagnetic shock 
waves [112], for which macromotion of a medium and changes 
in thermodynamic states are of no principle. Now some 
works appeared on electromagnetic shock waves in a va- 
cuum.5 

Shock waves are regions of quick changes (‘jumps’) of pa- 
rameters of a nonlinear medium or field that arise as a 
smooth perturbation evolves. A well-known example is 
shock waves in explosions. At one time formation of such 
shock waves in a homogeneous medium under ‘good’ (suf- 
ficiently smooth) initial conditions came as a surprise (the 
phenomenon even came to be known as ‘gradient catastro- 
phe’). But already at about the same time, in the 1940s, the 
nature of this phenomenon was revealed. It was associated 
with the absence (or rather smallness) of dispersion in the 
nonlinear medium in which disruptions (jumps) of the va- 
riables occur. Under weak dispersion all the harmonic com- 
ponents of the initial perturbation travel with virtually 
the same velocities, and as ever new harmonics come into 
being energy is transferred upwards in the spectrum, thus 
giving rise to infinitesimal wavelengths corresponding to 
jumps of field parameters. In the language of space and time 
this means that in one interval of a wave its profile becomes 
more steep and eventually a discontinuity forms. In the vi- 
cinity of the discontinuity are felt even quite weak ‘high- 


5 In a vacuum appropriate states of the electromagnetic field are 
described now not by Maxwell’s equations, which are linear in a 
vacuum, but by various models of nonlinear field theory, which take 
into account interactions between photons when their density is high 
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frequency’ dissipation and dispersion. Their combined 
action is responsible for the establishment of stationary 
shock waves. 

On the front of the stationary shock wave variables are 
only dependent on one coordinate E = x — Vt, which trav- 
els with the velocity of the discontinuity. The structure of 
the front (the profile of the stationary shock wave) is indep- 
endent of the boundary and initial conditions. Such waves 
are described by nonlinear ordinary differential equations 
and therefore they can be investigated completely. Corre- 
sponding to the front of the shock wave in the phase space of 
these equations is the phase path (separatrix) between two 
equilibrium states (one corresponding to the state of the 
medium or field before the discontinuity, the other after the 
discontinuity). If the path is unique, then to given bound- 
ary and initial conditions there corresponds a unique discon- 
tinuous solution whose structure is determined precisely by 
this path. 

Situations are possible (e.g. in magnetic hydrodynamics) 
when certain initial data are satisfied by several discontinu- 
ous solutions. But only those are realized which depend con- 
tinuously on the initial and boundary conditions. Discon- 
tinuities for which weak perturbation of initial data causes 
a finite change of the solution are unstable. Naturally, dis- 
continuities, or ‘jumps’, that form as the slope of the pro- 
file of a simple wave keeps becoming steeper, always de- 
pend on initial data. 

The requirement that discontinuous solutions depend on 
initial and boundary conditions® coincide with the unique- 
ness conditions for the structure of a shock wave [114]. 
This remarkable thing explains why shock waves are stable 
whatever the structure of their front, which is determined by 
the details of dispersive and dissipative characteristics of 
the medium in the region of fast change of the variables. 
So if on the front the key role is played by energy losses, 
the’ wave’s profile will be smooth (Fig. 21a). But if the pic- 
ture is dominated by dispersion, and dissipation is insig- 
nificant, the front will be strongly indented—with small dis- 


6 It follows only from the boundary conditions at the discontinuity 
and is in no way related to the structure of the front of the shock 
wave. 
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Fig. 21 Stationary shock waves: (a) weak dissipation at the wave front; 
(b) the structure of the stationary wave front in a medium with dis- 
persion and negligible dissipation 


sipation the wave will consist of a sequence of nearly identi- 
cal and nearly independent pulses that follow one after 
another (Fig. 240). 

In the limit of infinitesimal dissipation the pulse spacing 
on the shock wave’s front grows indefinitely (and thus the 
very notion of the ‘wave front’ becomes meaningless); and 
the energy pumping to the pulse tends to zero. Such solitary 
pulses or waves are called solitons. Their counterpart in 
the phase space of stationary waves (described by a conserv- 
ative dynamic system) is the separatrix originating at a 
saddle-type equilibrium state and terminating at it as well— 
a closed loop of the separatrix (see Fig. 2). 

Solitons are an absolutely remarkable example of non- 
linear structures—they are quite stable in purely conserv- 
ative media, where there are no dissipation and external en- 
ergy sources. Their stability (see below) and dependence on 
initial and boundary conditions, as illustrated by transition 
from the stable shock wave, and their occurrence in a wide 
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variety of experimental. situations, are all in a sense re- 
sponsible for their ‘privileged’ role in nonlinear physics. In 
particular, during this decade attempts have been made to 
use solitons to describe strong interactions in quantum field 
theory [115]. It is solitons that are responsible for propaga- 
tion of dislocations in crystals and of magnetic flux along a 
distributed Josephson junction [116]. They make their ap- 
pearance as a result of self-focusing and self-modulation of 
waves [117]; they correspond to wave packets on deep water 
[148], and describe the behavior of the nonideal Bose gas 
with weak interaction between particles [119]. Models are 
now available that relate solitons to cyclones (anticyclones) 
in the dynamics of planetary atmospheres [120], to collec- 
tive excitations in molecular chains [121], and so forth. 

The basic model equations of soliton theory are the well- 
known nonlinear evolutionary equations: the Korteweg-de 
Vries equation, the Klein-Gordon equation, the nonlinear 
Schrédinger equation, and some others [122, 123]. Clearly, 
since these equations include high-frequency dissipation, 
they have solutions in the form of shock waves. 

A knowledge of solitons—elementary localized excita- 
tions of a nonlinear system—provides an insight into a great 
variety of physical phenomena (again using perturbation 
theory, but now at another level). It thus becomes possible 
to describe their weak interaction with one another and 
with nonsoliton excitations [124, 125], their propagation 
through inhomogeneous media, specifically in connection 
with tsunami [126], the behavior in external fields, and so 
on. In recent years interest has been focused on investiga- 
tions of the properties of an ensemble, or ‘gas’, of solitons. 
It is hoped that appropriate models will be of help in con- 
structing a theory of strong turbulence in a plasma [127], in 
describing various ‘strong excitations’ in solids [128], and so 
on. Some papers speak already not only about the ‘kinetics’, 
but also about the ‘thermodynamics’ of solitons [129]. 

Clearly, a perturbed solution in the form of an ensemble 
of interacting solitons can conveniently be constructed on 
the basis of stable solitons. Their stability, however, is not 
a necessary prerequisite; it is only sufficient that the life- 
time of solitons be longer than the characteristic time of the 
problem. 

Weak interaction of solitons is realized when at each in- 
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stant of time the total field differs but slightly from the 
superposition of fields of individual solitons. This is pos- 
sible if the energies E of solitons, and hence their velocities 
V, are similar, and throughout the interaction process the 
distance between centres of mass of solitons remains large. 
Equations for coordinates of solitons are similar to New- 
ton’s equations for particles, for which the interaction po- 
tential is determined by declining fields, or ‘tails’, of soli- 
tons. For two solitons we have dS/dt = f (V, S), where 
S is the soliton spacing, and f (V, S) x exp (—AS) + c.c. 
1130]. This model describes attraction and repulsion of so- 
litons, as well as formation of bound states. In the general 
case are possible bound states of N solitons—so-called mul- 
tisolitons [434]. 

Multisolitons are perhaps of principal importance in 
fundamental physics, specifically for quantum systems (ele- 
mentary particles [132]). So, the structure of nucleons with 
wave function > in a self-consistent field of nuclear forces 
with the Yukawa potential © is described by the equations 


—ip, +Ap+0p=0, AO—O= — |y]?. 


These equations have solutions in the form of solitons and 
multisolitons [131]. 

It is to be stressed that the dynamics of even one soliton 
in the field of other solitons or a periodic wave may be a 
complex one, and, in particular, stochastic. We will illus- 
trate this with reference to the soliton of the envelope of 
a high-frequency field interacting with a low-frequency wave. 
This interaction is described by the equations inp; + xx — 
np = 0, ny — Naxx = ( Pax, Where p is the complex en- 
velope of the high-frequency field, and n is the component 
of the low-frequency field. These equations describe inter- 
action of Langmuir and ion-acoustic waves in a nonisother- 
mic plasma [133], surface and internal waves in the sea, 
electron excitations in long spiral molecules, and so on. 
These equations have solution in the form of a soliton with 
a high-frequency ‘filling’.” When a low-frequency field is 


7 ‘Envelope waves’ are a most important and decply studied branch 
of the theory of waves. These waves are characterized by the many 
phenomena observed in the fields themselves. In particular, there is 
evidence for envelope shock waves [117], breaking, and so on. 
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smoothly inhomogeneous (in comparison with scales of the 
soliton), the soliton follows the variations of n (x, t) and 
its velocity and amplitude change slowly. With this formu- 
lation of the problem the soliton’s motion can be described 
using an asymptotic technique. As a result, we arrive at a 
model of the parametrically excited oscillator, which shows 
stochasticity [134], similar to that of a charged particle 
moving in the field of a standing wave. 

The existence of the soliton as an elementary object, i.e. 
its stability (which specifically suggests the adiabatic na- 
ture of the interaction of the soliton with the nonsoliton part 
(‘tails’) or external fields), depends on the kind of nonlin- 
earity of the medium, its dispersion law and the dimension 
of the space. In particular, one necessary condition of stabil- 
ity of solitons is that one of the integrals of motions (e.g. 
for energy) at the soliton must take on an extremal value (a 
corollary of Lyapunov’s theorem) [135]. Although no suf- 
ficient condition of stability has yet been formulated, it 
would obviously be of use to determine whether or not there 
exists an extremum of the integral in order to be able to 
predict an instability. 

For instance, in media with weak dispersion the propaga- 
tion of solitons of dimension higher than one is described 
by the Kadomtsev-Petviashvili equation [136] 


-Z (u, + uuy + Uyxx) = BHA u. (14) 


This equation, which is a generalization of the Korteweg-de 
Vries equation to the case of transverse diffusion, becomes 
the Korteweg-de Vries equation at x = 0. For x < 0 (neg- 
ative dispersion) equation (14) describes long gravitation- 
al waves on water, sound in solids, and so forth. The case 
of x >0 (positive dispersion) corresponds to gravitational- 
capillary waves, phonons in liquid helium and other waves 
with the so-called decomposition spectrumê—only waves 
with such spectrum form two- and three-dimensional soli- 
tons. In media with a nondecompositional spectra of waves 
one-dimensional solitons are simply stable in respect of 
transverse perturbations [136]. 


8 A spectrum is said to be decompositional if for waves of the 
same type the synchronism conditions hold: 


k, + k, = k;, o (kı) + o (k) = o (ks). 
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The soliton—a stationary solution of (14) of the form u = 
u (x — vt, rı), which falls off in all directions—is given 
by [137] 





0° = 
[v2 der + 3A 1 Ir |u=3 Ox? u?, (15) 


where v >0 (the existence condition for a localized solu- 
tion). Equation (15) can be written in the Hamiltonian form 
with the Hamiltonian (x = 1 


= | 3/243 (V1 w)/2— u] dr, 


where w, = u. Using scale transformations for a two-dimen- 
sional space we can show [138] that at the two-dimensional 
soliton the Hamiltonian attains its minimum, i.e. such a 
soliton may well be stable in respect of two-dimensional per- 
turbations. In a three-dimensional space there is no extre- 
mum, i.e. the soliton is unstable in respect of three-dimen- 
sional perturbations [138]. 

To get an understanding of the physics of stability of so- 
litons it is necessary to answer the question of how pertur- 
bations that distort the soliton ‘decay’ in a medium without 
energy dissipation. There are two typical mechanisms here: 
a perturbation either goes away from the soliton into the 
‘tail’ (radiation) or is absorbed by it, with the result that 
the soliton gets somewhat more energy, while remaining a 
soliton. In each specific case the problem consists in finding 
out which of these or similar mechanisms is at work. 

Concluding the discussion of solitons we note that in 
many nonlinear models in nonlinear physics solitons play the 
role that is in a way similar to that of harmonic waves in 
quasilinear physics. They can be used to construct suffi- 
ciently general solutions that enable one to understand and 
describe some strongly nonlinear phenomena, including ones 
in weakly nonconservative systems. Specifically, account of 
the fact that solitons can derive energy from external fields 
allows a description of the stable existence of soliton-type 
structures in viscous media. This is exactly the way in 
which the Great Red Spot on Jupiter is now accounted for: 
as an anticyclonic eddy (a two-dimensional soliton of the 
Rossby waves [120, 139] spinning about its axis in the oppo- 
site direction to the planet’s rotation). Such eddies are mo- 
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a). 


Fig. 22 Shear flow of a fluid in a 

vessel with a parabolic profile of the 
bottom that rotates around a ver- 
tical axis: (a) the velocity displace- 
ment corresponds to formation of 
cyclones; (b) the fourth azimuth- 
al mode, the velocity displace- | 
ment corresponds to formation of 
anticyclones; (c) third azimuthal 
mode, the velocity displacement 
corresponds to anticyclone forma- 
tion. The pictures show white test 
particles floating on the water sur- 
face against the background of the 
black bottom, taken using a rotat- 
ing camera. The white blot in the 
centre is the drive of the set-up [140] 


delled in laboratory, they are produced in a thin fluid 
layer on the surface of a rotating parabolic vessel. Because 
of viscosity anticyclones have a finite lifetime. But if the 
system has shear flows, such eddies emerge due to the devel- 
opment of shear instability (Helmholtz instability) and 
can already exist stationarily—viscosity losses being made 
NE the energy supplied from the mean flow (Fig. 22) 
[140]. 

Solitons in weakly dissipative media pumped by exter- 
nal energy sources are an example of auto-waves. These 
waves can be described using a new nonlinear perturbation 
theory, which is based on the well-known nonlinear solu- 
tion of unperturbed (exactly integrable) system. In recent 
years, in connection with the most remarkable achievements 
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of mathematical physics—the discovery of inverse problem 
technique [123]—the possibilities of investigation into sys- 
tems close to exactly integrable ones have become ex- 
ceedingly wide. 


4. Conclusion 


The last two decades have given to nonlinear physics very 
much, yet its basic problems still await their solution. Among 
these are the description of nonlinear quantum systems and 
fields, the assessment of possibilities of their stochastiza- 
tion, the formation of stable structures (particles), the analy- 
sis of nonlinear phenomena not covered by the models de- 
scribed using dynamic systems (for instance, systems with 
self-adjustable structure), and many other problems, which 
of course include the already classical problem of turbulence. 
An understanding of turbulence mechanisms is virtually 
of no help for describing specific turbulent flows. It is un- 
clear, for example, what physical mechanisms are respons- 
ible for the establishment in a conventional turbulent 
boundary layer of precisely the logarithmic profile of velocity, 
what is the spectrum of noise emitted by a turbulent jet, 
and so forth? These and related problems are difficult be- 
cause real turbulent flows are inhomogeneous in space and 
generally are so dissimilar to homogeneous isotropic turbu- 
lence (which alone has a complete description [141]) that 
solutions obtained cannot even be used as zero approxima- 
tions, which could be employed as starting points for a per- 
turbation theory. Paradoxically, not infrequently approxi- 
mate theories draw on solutions corresponding not to 
the turbulent regime but to the regular regime, which is the 
precursor of the former. This solution at any rate takes into 
consideration real features of the turbulent flow in question 
(velocity profile, boundaries, etc.). This approach appears to 
be fruitful also because many flows, even fairly far beyond 
the turbulence transition point, turn out to be incompletely 
chaoticized; they contain structures that are also inherent in 
the preturbulent regime, namely coherent vortex structures 
[93]. The most popular example here is the recurrence of 
the Karman vortex street in a developed turbulent wake after 
a cylinder (at Re = 10°) [142]. 
It may well be assumed that the secondary structures 
emerge against the background of small-scale turbulence in 
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the wake, just asthe Kärmän street does in the laminar flow 
[143]. Extensive scientific research on the dynamics of co- 
herent structures in turbulent flows brought about a resur- 
gence of interest in classical [143, 144] problems of vortex 
interaction [145-147]. Attempts have been made, in particu- 
lar, to construct a theory of turbulent shear currents by re- 
presenting the latter as an ensemble of structures (toroidal 
vortices, vortex ‘horseshoes’, etc.) that have random para- 
meters, in a manner like ‘soliton gas’ in a strong Langmuir 
turbulence in a plasma. 

The answer to the question of how a quantum system sto- 
chastic in the classical limit must behave is far from trivi- 
al. The point is that in quantum mechanics stochasticity is, 
strictly speaking, impossible [44]. The discrete nature of 
the spectrum of a closed system bounded in the phase space 
(and the discreteness of the phase space itself) implies that 
the movement of the quantum system can be either periodic 
or quasiperiodic. But at the same time there is the corre- 
spondence principle, which calls for a transition from the 
quantum system to a classical one, in which stochastic be- 
havior is already possible [41, 148]. The snag is removed 
[44] by introducing the notion of transient chaos. The cor- 
respondence principle only holds at finite times, therefore 
the motion of a quantum system and its classical counterpart 
should be compared not with t — oo (which would be quite 
natural for stochastic systems), but with some character- 
istic times. One nontrivial thing here is that the stochasticity 
of quantum systems is found at times much longer than the 
dynamic scale obtained from the Ehrenfest theorem with 
account of the fast smearing out of wave packets. 

It should be noted in conclusion that the two concepts of 
nonlinear physics—stochasticity and structures—that are 
given in the title of the article and are thereby sort of set 
against each other are in fact not antipodes in the general 
sense of the word. Chaos and order can, specifically, trans- 
form continuously into each other as a system’s parameters 
change. We would even be safe in saying that there is nei- 
ther absolute order nor absolute chaos—these are just limit- 
ing cases. But ary real system is always in some intermedi- 
ate state, and one should only estimate the degrce of close- 
ness of this state to one of the limiting states, i.e. to abso- 
lute order or total disorder. 


1. 
2. 


A 


SS @ N 9 © 


13. 
14. 


15. 
16. 


17. 
18. 
19. 
20. 
21. 
22. 


Nonlinear Physics. Stochastieity and Structures 285 


References 


L.I. Mandelstam, Lectures on the Theory of Oscillations, Moscow, 
Nauka, 1972 (in Russian). 

A.A. Andronov, ‘L.I. Mandelstam and the Theory of Nonlinear 
Oscillations’, in: Academician Mandelstam: To the 100th Anniv- 
ersary, Moscow, Nauka, 1979, p. 98 (in Russian). 


. A.A. Andronov, A.A. Vitt, and C.E. Khaikin, The Theory of 


Oscillations, Moscow, Fizmatgiz, 1959 (in Russian). 


. A.V. Gaponov-Grekhov and M.I. Rabinovich, ‘L.I. Mandelstam 


and the Modern Theory of Nonlinear Oscillations and Waves’, 
Uspekhi Fiz. Nauk, 128: 579 (1979). 


. B. Pol van der, ‘A Theory of the Amplitude of Free and Forced 


Triode Vibration’. Radio Rev., 1: 701 (1920). 


. E.M. Lifshitz and L.P. Pitaevskii, Physical Kinetics, Pergamon 


Press, 1981. 


. R. Courant and K.O. Fridrichs, Supersonic Flow and Shock 


Waves, Interscience, 1948. 


. T.B. Benjamin and M.J. Lighthill, ‘On Cnoidal Waves and 


Bores’, Proc. Roy. Soc. London A, A224: 448 (1954). 


. M.A. Aizerman, Lectures on the Theory of Automatic Regulation, 
10. 
11. 
12. 


Moscow, Fizmatgiz, 1958 (in Russian). 

S.A. Akhmanov and R.V. Khokhlov, Problems of Nonlinear 
Optics, Moscow, VINITI, 1964 (in Russian). 

V.M. Fain and Ya.I. Khanin, Quantum Radiophysics, Moscow, 
Sov. Radio, 1965 (in Russian). 

G.A. Askar’yan, ‘Impact of the Field Gradient of the Intense 
Electromagnetic Beam on Electrons and Atoms’, Zhurn. Eksp. 
Teor. Fiz., 42: 1567 (1962). 

V.I. Talanov, ‘Self-Focusing of Electromagnetic Waves in 
Nonlinear Media’, Izv. Vuzov, Radiofizika, 7: 564 a 

V.N. Oraevsky and R.Z. Sagdeev, ‘On Stability of Established 
Longitudinal Oscillations of Plasma’, Zhurn. Tekh. Fiz., 32: 
1291 (1962). 

R.Z. Sagdeev and A.A. Galeev, Nonlinear Plasma Theory, ed. 
T. O’Neil and D. Book, Benjamin, 1969. 

A.A. Vedenov, Ye.P. Velikhov, and R.Z. Sagdeev, ‘Quasilinear 
Theory of Plasma Oscillations’, Yadern. Sintez, Prilozhen., 3: 
1049 (1962). 

V.E. Zakharov, ‘Weak Turbulence in Media with Decay Spect- 
rum’, Zhurn. Prikl. Mekh. Tekh. Fiz., 4: 35 (1965). 

B.B. Kadomtsev, ‘Plasma Turbulence’, in: Problems of Plasma 
Theory, Issue 4, Moscow, Gosatomizdat, 1964, p. 188 (in Russian). 
N.N. Bogolyubov and Yu.A. Mitropol’sky, Asymptotic Methods 
in the Theory of Nonlinear Oscillations, Moscow, Fizmatgiz, 1963 
(in Russian). 

Sh. Ma, Modern Theory of Critical Phenomena, Benjamin, 1976. 
J. Scott-Russell, ‘Report on Waves’, Proc. Roy. Soc. Edinburgh, 
1844, p. 319. 

N.J. Zabusky and M.D. Kruskal, ‘Interactions of Solitons in 
a Collisionless Plasma and the Recurrence of Initial States’, 
Phys. Rev. Lett., 15: 240 (1965). 


286 


23. 


A. V. Gaponov-Grekhov and M. I. Rabinovich 


A.N. Kolmogorov, ‘A New Metric Invariant of Transitive 
Dynamic Systems and of Automorphisms of the Lebesgue Space’, 
Dokl. AN SSSR, 119: 861 (1958). 


23a. A.N. Kolmogorov, ‘Entropy per Unit Time as a Metric Invariant 


of Automorphisms’, Dokl. AN SSSR, 124: 754 (1959). 


24. A. Poincaré, Calcul des probabilités, Gautier-Villard, 1912. 


25 


26. 
27. 
28. 


29. 
30. 
31. 


32. 


33. 
34. 


36. 
37. 


38. 
39. 
40. 
4. 


42. 
43. 


. D.V. Anosov, ‘Geodesic Fluxes on Closed Riemannian Manifolds 
of Negative Curvature’, Trudy Mat. Inst. im. V.A. Steklova 
AN SSSR, 90: 3 (1967). 

Ya.G. Sinai, ‘Ergodic Properties of Scattcring Billiards’, Uspekhi 
Mat. Nauk, 25: 141 (1970). 

D. Ruelle and F. Takens, ‘On the Nature of Turbulence’, Com- 
muns Math. Phys., 20: 167 (1974). 

Ya.B. Zel’dovich and Yu.P. Raiser, Physics of Shock Waves and 
of High-Temperature Hydrodynamic Phenomena’, Moscow, Nauka, 
1966 (in Russian). 

T. Rikitabe, ‘Oscillations of a System of Disk Dynamos’, Proc. 
Cambridge Philos. Soc., 54, 1: 89 (1958). 

E. Lorentz, ‘Deterministic Nonperiodic Flow’, J. Atmos. Sci., 
20: 130 (1963). 

E.B. Vuland Ya.G. Sinai, ‘Computer-Aided Detection of Period- 
ic Paths of Dynamic Systems’, Preprint ITF im. L.D. Landau, 
1980 (in Russian). 

V.S. Afraimovich, V.V. Bykov, and L.P. Shil’nikov, ‘On Attrac- 
tive “Noncoarse Grain” Limit Sets of the Lorentz Attractor Type’, 
Trudy Mosk. Mat. Obshch., 44: 152 (1982). 

F.M. Izrailev and B.V. Chirikov, ‘Statistical Properties of a 
Nonlinear String’, Dokl. AN SSSR, 166: 57 (1966). 

M. Toda, ‘On a Nonlinear Lattice’, in: Solitons, Springer-Verl., 
1980, p. 143. 


. D.D. Ryutov and G.V. Stupakov, ‘Transverse Losses of Particles 


in Ambipolar Plasma Trap’, Pis'ma Zhurn. Eksp. Teor. Fiz., 
26: 186 (1977). 

V.D. Il’in and A.N. Il’ina, ‘A Change in the Magnetic Moment 
of a Charged Particle During Nonadiabatic Motion in a Dipole 
Field’, Zhurn. Eksp. Teor. Fiz., 72: 983 (1977). 

E.F. Jaeger, A.J. Lichtenberg, and M.A. Lieberman, ‘Theory 
of Electron Cyclotron Resonance Heating. 1. Short Time and 
Adiabatic Effects’, Plasma Phys., 14: 1073 (1972). 

G.M. Zaslavsky and B.V. Chirikov, ‘Stochastic Instability of 
Nonlinear Oscillations’, Uspekhi Fiz. Nauk, 105: 3 (1974). 
S.V. Kiyashko, A.S. Pikovsky, and M.I. Rabinovich, ‘Radio- 
Range Self-Excited Oscillator with Stochastic Behavior’, Radio- 
tekhn. i Elektr., 25: 336 (1980). 

M.I. Rabinovich, ‘Stochastic Self-Oscillations and Turbulence’, 
Uspekhi Fiz. Nauk, 125: 123 (1978). 

B.V. Chirikov, F.M. Izrailev, and D.U. Shepelyansky, ‘Dyna- 
mical Stochasticity in Classical and Quantum Mechanics’, Sov. 
Sci. Rev., Sect. C, 2: 209 (1981). 

L.P. Shil’nikov, ‘A Poincaré-Birkhhoff Problem’, Mat. Sb., 74: 
378 (1967). 

A.I. Morozov and L.S. Solov’ev, ‘Geometry of a Magnetic Field’, 
Vopr. Teor. Plasmy, 2: 3 (1963). 


44. 
45. 


46. 
47. 
48. 


49. 
50. 


51. 


52. 


53. 


54. 


55. 


56. 


57. 
58. 
59. 
60. 
61. 
62. 
63. 


64. 


Nonlinear Physics. Stochasticity and Structures 287 


N.S. Krylov, Works on Justification of Statistical Physics, Moscow, 
USSR Acad. Sc. Press, 1950 (in Russian). 

M. Born, ‘Vorhersagbarkeit in der Klassischen Mechanik’, 
Ztschr. Phys., 153: 372 (1958). 

Ya.G. Sinai, ‘Randomness of Nonrandom’, Priroda, 3: 72 (1981). 
V.K. Mel’nikov, ‘On the Stability of a Centre Under Time- 
Periodic Perturbations’, Trudy Mosk. Mat. Obshch., 12: 3 (1963). 
N.N. Nekhoroshev, ‘Exponential Estimate of Stability Time for 
Hamiltonian Systems Close to Integrable Ones’, Uspekhi Mat. 
Nauk, 32: 5 (1977). 

A.A. Andronov and L.S. Pontryagin, ‘“Coarse Grain” Systems’, 
Dokl. AN SSSR, 14: 247 (1937). 

O. Lanford, ‘Supplement to Article R.F. Williams: The 
Structure of Lorentz Attractors’ in: Lect. Notes in Math., Vol. 615, 
Springer-Verl., 1977, p. 143. 

V.S. Afraimovich and L.P. Shil’nikov, ‘On Some Global Bifur- 
cations Connected with Disappearance of a Fixed Point of the 
Saddle-Node Type’, Dokl. AN SSSR, 219: 1281 (1974). 

A. Arnesdo, P. Coullet, C. Tresser et al., ‘About the Observation 
of the Uncompleted Cascade in a Rayleigh-Bénard Experiments’, 
Paris, Preprint, 1981. 

A.I. Gol’berg, Ya.G. Sinai, and K.M. Khanin, ‘Universal Prop- 
erties for a Sequence of Period Tripling Bifurcations’, Uspekhi 
Mat. Nauk, 38: 159 (1983). 

A.V. Gaponov-Grekhov, M.I. Rabinovich, and I.M. Starobinets, 
‘The Onset and the Development of Chaotic Structures in Dissip- 
ative Media’, in: Selfwaves in Biology, Chemistry and Physics, 
ed. V.I. Krinsky, Springer-Verl., 1984, p. 147. 

V.S. Afraimovich, M.I. Rabinovich, and A.D. Ugodnikov, 
‘Critical Points and Phase Transitions in a Parametrically 
Excited Anharmonic Oscillator with Stochastic Behavior’, 
Pis'ma Zhurn. Eksp. Teor. Fiz., 38, 2: 64 (1983). 

C. Grebogi, E. Ott, and J.A. Yorke, ‘Crisis, Sudden Changes 
in Chaotic Attractors and Transient Chaos’, Physica D, 7: 181 


(1983). 

J. Crutchfield, D. Farmer, N. Packard et al., ‘Power Spectral 
Analysis of a Dynamical System’, Phys. Lett. A, 76A: 1 (1980). 
M.J. Feigenbaum, ‘Universal Behaviour in Nonlincar Systems’, 
Los Alamos Sci., 1: 4 (1980). 

B.A. Huberman and J. Rudnick, ‘Sealing Behavior of Chaotic 
Flows’, Phys. Rev. Lett., 45: 154 (1980). 

A.S. Pikovsky, ‘About the Behavior of the Strange Attractor at 
a Critical Point’, Izv. Vuzov, Radiofizika, 25: 846 (1982). 

F. Ledrappier, ‘Some Relations Between Dimension and Lyapu- 
nov Exponents’, Communs Math. Phys., 81: 229 (1981). 
B.B. Mandelbrot, Fractals: Form, Chance and Dimension, Free- 
man and Co., 1977. 

P. Berge, M. Dobois, P. Manneville, and Y. Pomeau, ‘Intermit- 
u in Rayleigh-Bénard Convection’, J. Phys. Lett., 41: L341 
(1980). 

F.M. Izrailev, M.I. Rabinovich, and A.D. Ugodnikov, ‘Approxi- 
mate Description of Three-Dimensional Dissipative Systems with 
Stochastic Behavior’, Phys. Lett. A, 68A: 321 (1981). 


288 


65 


66. 


67. 


68. 


69. 


70. 
71. 
72. 
73. 


74. 
75. 
76. 
77. 
78. 


79. 
80. 


81. 
82. 


83. 
84. 


A. V. Gaponov-Grekhov and M. I. Rabinovich 


. H. Hayashi, M. Nakao, and K. Nirokawa, ‘Chaos in the Self- 
Sustained Oscillation of an Excitable Biological Membrane 
under Sinusoidal Stimulation’, Phys. Lett. A, 88A: 265 (1982). 
V.N. Gubankov, K.I. Konstantinyan, V.P. Koshelets, and 
G.A. Ovsyannikov, ‘Chaotic Instability in Tunnelling Josephson 
Junctions’, Pisma Zhurn. Tekh. Fiz., 8: 1332 (1982). 

V.I. Krinsky and A.M. Zhabotinsky, ‘Auto-Wave Structures 
and Future Development’, in: Auto-Wave Processes in Diffusive 
Systems, Gorki, IPF AN SSSR, 1981, p. 6 (in Russian). 

R.H. Simoy, A. Wolf, and H.L. Swinney, ‘One-Dimensional 
Dynamics in a Multicomponent Chemical Reaction’, Phys. Rev. 
Lett., 49: 245 (1982). 

F.T. Arecchi, R. Meucci, G. Puccioni, and J. Tredicce, ‘Experi- 
mental Evidence of Subharmonic Bifurcations, Multistability 
and Turbulence in a Q-switched Gas Laser’, Phys. Rev. Lett., 
49: 1217 (1982). 

L.D. Landau, ‘About Turbulence Problem’, Dokl. AN SSSR, 
44: 339 (1944). 

J.P. Gollub and S.V. Benson, ‘Many Routes of the Turbulent 
Convection’, J. Fluid Mech., 100: 449 (1980). 

J.P. Eckmann, ‘Roads to Turbulence in Dissipative Dynamical 
Systems’, Rev. Mod. Phys., 53: 643 (1981). 

A. Brandstäter, J. Swift, H.L. Swinney et al., ‘Low-Dimension- 
ag) in a Hydrodynamic System’, Phys. Rev. Lett., 51: 1442 
(1983). 

B. Malraison, P. Atten, P. Berge, and M. Dubois, ‘An Experim- 
ental Determination for the Chaotic Regime of Two Convective 
Systems’, Preprint, France, 1983. 

F. Takens, ‘Detecting Strange Attractors in Turbulence’, in: 
Dynamical Systems and Turbulence, Lect. Notes in Math., Vol. 898, 
ed. D.A. Rand and L.-S. Young, Springer-Verl., 1981, p. 366. 
N. Wiener and A. Rosenblueth, ‘The Mathematical Formulation 
of the Problem of Conduction of Impulses in a Network Connected 
Excitable Elements, Specifically in Cardiac Muscle’, Arch. 
Inst. cardiol. Mex., 16: 205 (1946). 

A.M. Turing, ‘The Chemical Basis of Morfogenesis’, Philos. 
Trans. Roy. Soc. London B, 237: 37 (1952). 

B.P. Belousov, ‘Periodically Occurring Reaction and Its Mechan- 
ism’, in: Abstracts of Papers on Radiation Medicine, 1958’, Moscow, 
Medgiz, 1959, p. 145 (in Russian). 

A.M. Zhabotinsky, Concentration Self-Oscillations, Moscow, 
Nauka, 1974 (in Russian). 

C. Normand, Y. Pomeau, and M.G. Velarde, ‘Convective In- 
eb: A Physicist’s Approach’, Rev. Mod. Phys., 49: 591 
(1977). 

J.T. Stuart, ‘On the Nonlinear Mechanics of Hydrodynamic 
Stability’, J. Fluid Mech., 4: 1 (1958). 

P. Glansdorff and I. Prigogine, Thermodynamics of Structure, 
Stability and Fluctuations, Wiley-Interscience, 1971. 

H. Haken, Synergetics, Springer-Verl., 1978. 

Auto-Wave Processes in Diffussive Systems, Gorki, IPF AN SSSR, 
1981 (in Russian). 


85 


86. 


87. 


88. 


89. 
90. 


9. 


92. 


93. 


94. 


95. 


96 


97. 


98. 


99. 


Nonlinear Physics. Stochasticity and Structures 289 


. G. Nicolis and I. Prigogine, Self-Organization in Non-Equilibrium 
Systems, Wiley-Interscience, 1977. 
L.S. Polak and A.S. Mikhailov, Self-Organization in Nonequilib- 
pe Physico-Chemical Systems, Moscow, Nauka, 1983 (in Rus- 
sian). 
K.J. Tomchik and P.N. Devreotie, ‘Adenosine 3’ ,5’-Monophos- 
phate Waves in Dictyostelium discoideum: A Demonstration by 
Isotope Dilution-Fluorography’, Science, 212, 4493: 443 (1981). 
V.V. Barelko, I.I. Kurochka, A.G. Merzhanov, and K.G, Schka- 
dinskii, ‘Investigation of Traveling Waves on Catalytic Wires’, 
Chem. Eng. Sci., 33: 805 (1978). 
A.C. Scott, ‘The Electrophysics of a Nerve Fiber’, Rev. Mod. 
Phys., 47: 47 (4975). 
M.A. Allessie, F.I.N. Boake, and F.I.G. Schopman, ‘Circus 
Movement in Rabbit Atrial Muscle as a Mechanism of Tachycar- 
dia. III, The “Leading Circle” Concept: A New Model of Circus 
Movement in Cardiac Tissue Without the Involvement of an 
Anatomical Obstacle’, Circul. Res., 44: 9 (4977). 
V.I. Koroleva and G.D. Kuznetsova, ‘The Properties of Prop- 
agating Depression in the Presence of Two Potassium Centres 
in the Cortex of a Rat’, in: Electric Activity of the Brain, Moscow, 
Nauka, 1971, p. 130 (in Russian). 
A. Roshko, ‘Structure of Turbulent Shear Flows: A New Look’, 
AIAA Journal. 11: 1349 (1976). 
M.I. Rabinovich and M.M. Sushchik, ‘Coherent Structures io 
Turbulent Flows’, in: Nonlinear Waves: Self-Organization, 
Moscow, Nauka, 1983 (in Russian). 
C. Vidal and A. Pacault, ‘Spatial Chemical Structures, Chemical 
Waves: A Review’, in: Evolution of Order and Chaos, ed. H. Haken, 
Springer-Verl., 1982, p. 74. 
Y. Kuramoto and S. Koge, ‘Turbulized Rotating Chemical 
Waves’, Progress Theor. Phys., 66: 1081 (1981). 
. A.S. Kingsep, L.I. Rudakov, and R.N. Sudan, ‘Spectra of 
Strong Langmuir Turbulence’, Phys. Rev. Lett., 31: 1482 (1973). 
A. Hodgkin and A. Huxley, ‘A Quantitative Description of 
Membrane Current and Its Application to Conduction and Excit- 
ation in Nerve’, J. Physiol., 117: 500 (1952). 
Ya.B. Zel’dovich and D.A. Frank-Kamenetsky, ‘The Theory of 
(1388) Propagation of Flame’, Dokl. AN SSSR, 19: 693 
). 
A.N. Kolmogorov, I.E. Petrovsky, and N.S. Piskunov, ‘Analysis 
of the Diffusion Equation Connected with Increasing Amount of 
Matter, and Its Application to a Biological Problem’, in: The 
Theory of Deflagration and Explosions, Moscow, Nauka, 1981, 
p. 213 (in Russian). 


100. S. Smale, ‘A Mathematical Model of Two Cells via Turing’s 
Equation’, Lect. Appl. Math., 6: 15 (1974). 

101. V.I. Talanov, ‘Forced Diffusion and Cooperative Effects in 
Distributed Kinetic Systems’, in: Nonlinear Waves: Self-Organi- 
zation, Moscow, Nauka, 1983, p. 47 (in Russian). 

102. V.I. Krinsky and K.I. Agladze, ‘Vortices with Topological 


Charge 2, 3, and 4 in an Active Chemical Medium’, Dokl. AN 
SSSR, 263: 335 (1982). 


19-01071 


256 
103. 


104. 


105. 
106. 


107. 


108. 
109. 


110. 


411. 


112. 
113. 


114. 
115. 
116. 


117. 
118. 


119. 


120. 


121. 


122, 


123. 


A. V. Gaponov-Grekhov and M. 1. Rabinovich 


D.V. Lyubimov, G.F. Putin, and V.I. Chernatynsky, ‘About 
Convective Motions in the Hele-Shaw Cell’, Dokl. AN SSSR, 
237: 554 (1977). 

A.V. Gaponov-Grekhov and M.I. Rabinovich, ‘Nonstationary 
Structures—Chaos and Order’, in: Synergetics of the Brain, 
ed. E. Basar et al., 1983, p. 227. 

H.T. Moon, P. Huerre, and L. Redekopp, “Transition to Chaos 
in the Ginzburg-Landau Equation’, Physica D, 7: 135 (1983). 
N. Kopell and L.N. Howard, ‘Plane Wave Solutions to Reaction- 
Diffusion Equations’, Stud. Appl. Math., 52: 291 (1973). 

V.S. L’vov and A.A. Predtechensky, ‘Step-by-Step Transition 
to Turbulence in the Couette Flow’, in: Nonlinear Waves: Sto- 
chasticity and Turbulence, Gorki, IPF AN SSSR, 1980, p. 57 
(in Russian). 

R.D. Ray, ‘Plane Sound Waves of Finite Amplitude’, J. Acoust. 
Soc. Amer., 3: 222 (1931). 

R.Z. Sagdeev, ‘Collective Processes and Shock Waves in Rare- 
fied Plasma’, in: Problems of Plasma Theory, Issue 4, Moscow, 
Atomizdat, 1964, p. 20 (in Russian). 

S.I. Syrovatsky, ‘Magnetic Hydrodynamics’, Uspekhi Fiz. Nauk, 
62: 247 (1957). 

L.D. Landau and E.M. Lifshitz, Electrodynamics of Continuous 
Media, Pergamon Press, 1960. 

A.V. Gaponov and G.I. Freidman, ‘About Shock Electromag- 
netic Waves in Ferrites’, Zhurn. Eksp. Teor. Fiz., 36: (1959). 
V.B. Berestetsky, E.M. Lifshitz, and L.P. Pitaevsky, Quantum 
Electrodynamics, Moscow, Nauka, 1980 (in Russian). 

R.V. Polovin, ‘Nonlinear Magnetohydrodynamic Waves’, Differ- 
ents. Uravn., 1: 499 (1965). 

J. Brown and A. Jackson, Nucleon-Nucleon Interactions, Moscow, 
Atomizdat, 1979. 

R.D. Parmentier, ‘A Fluxon in Long Josephson Junction’, in: 
Solitons in Action, Acad. Press, 1978, p. 173. 

L.A. Ostrovsky, ‘Propagation of Wave Packets in Nonlinear 
Dispersive Medium’, Zhurn. Eksp. Teor. Fiz., 51: 1189 (1966). 
H.C. Yuen and W.E. Ferguson, ‘Relationship Between Benja- 
min-Feir Instability and Recurrence in the Nonlinear Schrédinger 
Equation , Phys. Fluids, 21: 1275 (1978). 

A.H. Luther, ‘Quantum Solitons in Statistical Physics’, in: 
Solitons, ed. R.K. Bullough and P.J. Caudrey, Springer-Verl., 
1980, p. 355. 

T. Muxworthy and L.G. Redekopp, ‘A Solitary Wave Theory 
of the Great Red Spot and Other Observed Features in the Jupiter 
Atmosphere’, Icarus, 29: 261 (1976). 

A.S. Davydov, ‘Nonlincar Oscillatory Phenomena in Biology’, 
in: Nonlinear Waves. Propagation and Interaction, Moscow, 
Nauka, 1981, p. 42 (in Russian). 

A.C. Scott, F.Y.F. Chu, and D.W. McLaughlin, ‘Soliton: A New 
Concept in Applied Science’, Proc. IEEE, 61: 1443 (1973). 
V.E. Zakharov, S.V. Manakov, S.P. Novikov, and L.P. Pitaev- 
sky, The Theory of Solitons: The Method of Inverse Problem, 
Moscow, Nauka, 1980 (in Russian). 


127. 
128. 


129. 


130. 


131. 


132. 
133. 


134. 


135. 
136. 


137. 
138. 


139. 


140. 


141. 


142. 


143. 


19* 


Nonlinear Physics. Stochasticity and Structures 291 


. D.W. McLaughlin and A.C. Scott, ‘A Multisoliton Perturbation 


Theory’, in: Solitons in Action, Acad. Press, 1978, p. 201. 


. V.I. Karpman and E.M. Maslov, ‘The Structure of “Tails” 


That Form as Perturbations Affect Solitons’, Zhurn. Eksp. 
Teor. Fiz., 75: 504 (1978). 


. E.N. Pelinovsky, ‘Tsunami Waves’, in: Nonlinear Waves. Prop- 


agation and Interaction, Moscow, Nauka, 1981, p. 187 (in Rus- 
sian). 

In Rudakov and Y.N. Tsytovich, ‘Strong Langmuir Turbu- 
lence’, Phys. Repts, 40C, 1: 1 (1978). 

V.M. Eleonsky, ‘Solitons in Magnetic Media’, in: Nonlinear 
Waves: Self-Organization, Moscow, Nauka, 1983, p. 129 (in 
Russian). 

S.F. Krylov and V.V. Yan’kov, ‘The Role of Solitons in Strong 
Langmuir Turbulence’, Zhurn. Eksp. Teor. Fiz., 79: 82 (1980). 
K.A. Gorshkov, L.A. Ostrovsky, and V.V. Papko, ‘Interaction 
and Bound States of Solitons as Classical Particles’, Zhurn. 
Eksp. Teor. Fiz., 71: 585 (1976). 

K.A. Gorshkov, V.A. Mironov, and A.M. Sergeev, ‘Bound 
Stationary Soliton Formations’, in: Nonlinear Waves: Self- 
Organization, Moscow, Nauka, 1983, p. 112 (in Russian). 

E. Witten, ‘Baryons in the 1/N Expansion’, Nucl. Phys. B, 
160: 57 (1979). 

V.E. Zakharov and A.M. Rubenchik, ‘About Nonlinear Inter- 
action of High-Frequency and Low-Frequency Waves’, Zhurn. 
Prikl. Mekh. Tekh. Fiz., 5: 84 (1972). 

I.S. Aranson, K.A. Gorshkov, and M.I. Rabinovich, ‘The Onset 
of Stochasticity in the Interaction of a Modulation Soliton with 
Low-Frequency Waves’, Zhurn. Eksp. Teor. Fiz., 86: 961 (1984). 
V.E. Zakharov and E.A. Kuznetsov, ‘About Three-Dimensional 
Solitons’, Zhurn. Eksp. Teor. Fiz., 66: 594 (1974). 

B.B. Kadomtsev and V.I. Petviashvili, ‘On the Stability of 
Solitary Waves in Weakly Dispersive Media’, Dokl. AN SSSR, 
192: 753 (1970). 

V.I. Petviashvili, ‘About the Equation of Unusual Soliton’, 
Fiz. Plazmy, 2: 460 (1976). 

E.A. Kuznetsov and S.K. Turitsyn, ‘On Two-Dimensional and 
Three-Dimensional Solitons in Weakly Dispersive Media’, 
Zhurn. Eksp. Teor. Fiz., 82: 1457 (1982). 

V.I. Petviashvili, ‘The Jupiter Great Red Spot and Drift Soliton 
in Plasma’, Pis’ma Zhurn. Eksp. Teor. Fiz., 32: 632 (1980). 
M.V. Nezlin, E.N. Snezhkin, and A.S. Trubnikov, ‘Kelvin- 
Helmholtz Instability and the Jupiter Great Red Spot’, Pis'ma 
Zhurn. Eksp. Teor. Fiz., 36: 190 (1982). 

A.N. Kolmogorov, ‘The Local Structure of Turbulence in In- 
compressible Fluid at Very Large Reynolds Numbers’, Dokl. 
AN SSSR, 30: 299 (1941). 

D.D. Papailiou and P.S. Lykoudis, ‘Turbulent Vortex Streets 
and the Entrainment Mechanism of the Turbulent Wake’, J. 
Fluid Mech., 62: 11 (1974). 

N.E. Kochin, I.A. Kibel’, and N.V. Roze, Theoretical Hydro- 
mechanics, Vol. 1, Moscow, OGIZ, 1948 (in Russian). 


292 


144. 
145. 


146. 
147. 
148. 
149. 


A. V. Gaponov-Grekhov and M. I. Rabinovich 


H. Lamb, Hydrodynamics, 6th ed., New York, Dover, 1932. 
E.A. Novikov, ‘Dynamics and Statistics of a System of Vortices’, 
Zhurn. Eksp. Teor. Fiz., 68: 1868 (1975). 

H. Aref and E.D. Siggia, ‘Vortex Dynamics of Two-Dimensional 
Turbulent Shear Layer’, J. Fluid Mech., 100: 705 (4980). 

S.V. Manakov and L.N. Shur, ‘Stochasticity in Two-Particle 
Scattering’, Pis'ma Zhurn. Eksp. Teor. Fiz., 37: 45 (1983). 
G.M. Zaslavsky, ‘Statistics of Energy Spectrum’, Uspekhi Fiz. 
Nauk, 129: 244 (1979). 

A.V. Gaponov-Grekhov, M.I. Rabinovich, and I.M. Starobinets, 
‘Dynamic Model of the Spatial Development of Turbulence’, 
Pis’ma Zhurn. Eksp. Teor. Fiz., 39: 561 (1984). ; 


Physics and Astrophysics 


in the Late 20th Century. 
Development 


and Future Outlook 


V. L. Ginzburg 


l. Introduction 


To what extent and for what time can one foresee and predict 
the development of science? And to what extent would such 
predictions be of interest and in what form are they to be 
made? To these and similar questions there are no clear and 
definite answers and a host of opinions are possible. The 
aim of this article is to elucidate and discuss the prospects 
of physics and astrophysics to the end of the 20th century; 
therefore, I cannot of necessity be too ambitious. The reader 
will not be offered a ‘ruling of a court of enquiry’, nor will 
he be given a program of actions involving administrative, 
financial and other aspects. Instead, he will find here notes 
and attitudes of one man (true, entertaining the hope that 
he has many sympathizers). At the same time I would not 
have written the article if I had not. considered a discussion 
of the paths of development of science an interesting and, 
at times, exciting pursuit. Moreover, this undertaking seems 
to be a useful and reasonable one (at least within certain lim- 
its), especially now that we witness a time of fairly drastic 
changes in forms of scientific endeavor. 

Not all hold this opinion, however; there are also quite 
negative views of this prospect. One argument is the unpre- 
dictability of the ‘creative process’, the unexpectedness of 
many discoveries, well known from science history. Well, per- 
haps the most interesting thing in physics and astrophysics 
(only these we are going to discuss in this article, although 
some general considerations are not inherent solely in some 
specific scientific disciplines)—is the enigmas and chal- 
lenges that present themselves in the course of research work, 
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Is it possible to produce high-temperature superconductors? 
Do there exist fairly long-lived atomic nuclei with atomic 
numbers Z > 110? Do there exist protoquarks (preons, 
etc.)? Is the lifetime of the proton finite, or the latter is 
stable? Is the mass of, say, electron neutrino zero? What 
occurs within the central regions of quasars and active ga- 
lactic nuclei, and within the nucleus of our Galaxy? Perhaps 
there is a massive black hole there, or even a double black 
hole? What occurs in the early phases of cosmological evolu- 
tion? (By the way, the very notion of the ‘early phase’ is 
here concerned with certain cosmological models, which may 
well turn out to be inadequate to describe the real Universe.) 

To all of these questions (which can easily be multiplied) 
we have no answers. What then is the meaning of ‘predic- 
tion’? It cannot be the drafting of a ludicrous plan containing 
phrases of the type: ‘By the year 2001 to find the lifetime of 
the proton’ or ‘By 2001 to produce high-temperature super- 
conductors,’ and so forth. Therefore, we will begin by clari- 
fying our task or rather our objective. 

The development of science brings forth a sea of ques- 
tions and problems. But not all of them are of equal signifi- 
cance, someof them surely stand out, they in some degree or 
other dictate, or, in any event, characterize the evolution 
of physics and astrophysics as a whole. We will refer to 
these problems as key ones. Distinguishing these key prob- 
lems is, to some extent, an arbitrary and difficult business. 
Suppose, however, that we still have a list of key problems 
in physics and astrophysics as of 1987, say. It would be quite 
natural then to ask: What progress can one expect in dealing 
with each of these problems? And how will a similar list 
look like in 2001? If we are successful in this undertaking, 
we will thereby make our assessment (supposition) about 
the prospects of development of physics and astrophysics 
by the end of the 20th century. 

We will now make some additional points. 

It is one thing to formulate a problem, to identify its place 
and significance, and quite another to indicate some ‘dead- 
lines’ for solving it. For example, in the 1920s came the 
issue of putting to use nuclear energy, produced when heavy 
nuclei were transmuted into lighter ones. It became clear 
that the heavier nuclei are potential storehouses of colossal 
amounts of energy. That fact stimulated the development 
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of nuclear physics and seemed to have given an impetus to 
many research projects. But even in the mid-1930s physi- 
cists saw no ways of using nuclear energy and made public 
statements to this effect, they were unwilling to recognize 
any real prospects of using nuclear energy. But then, in 
1939, the fission of uranium was discovered, and in 1942 the 
first nuclear reactor was already in operation. A chance dis- 
covery of nuclear fission was thus largely responsible for the 
quick change of the situation: the issue of uses of nuclear 
energy (as a result of fission) ceased to be a key problem— 
it had been solved. Further research efforts were rather con- 
cerned with applied physics and technology. Under other 
circumstances nuclear energy could have been listed among 
the key problems much longer. Anyway, when discussing a 
physics problem one should exercise great care assessing 
the time of its solution. 

No matter how fast physics and astrophysics develop, 
over a longer span of time, say decades, the development is 
slower than it sometimes appears to those involved. This 
is perhaps because the average age of physicists and astron- 
omers is not large, most of them have actively worked only 
10-20 years. And so it seems to them that the developments 
of 20-30 years back are almost ancient history. But workers 
of a more advanced age remember well the events of the 
early 1960s or even 1950s. It is absolutely clear to them that, 
even though recent years have seen some brilliant discover- 
ies and abundance of new evidence, it would not be true to 
say (with rare exceptions) that the physics of 1950 differed 
markedly from to-day’s. The special and general theories of 
relativity, nonrelativistic quantum mechanics, much of rel- 
ativistic quantum theory, relativistic cosmology, and many 
other sciences we are concerned with now, existed and 
‘worked’ back in 1950 and even before. Put another way, 
we should not underestimate the stability and continuity 
of the physical knowledge, at least on the scale of decades. 
This makes it especially clear that a discussion of the pro- 
spects of the development of physics and astrophysics to the 
end of the current century is not an extremely wild extra- 
polation. What is more, those mammoth modern set-ups— 
thermonuclear reactors, accelerators, space labs (observa- 
tories) and telescopes—normally take a decade, sometimes 
even more, to design and construct. . 
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Perhaps one may be surprised a bit by the fact that I 
here put together physics and astrophysics. Why not confine 
oneself to physics (the more so, as physics itself includes a 
good many disciplines, such as crystal physics, metal phys- 
ics, high-energy physics, etc.), or at the other end of the 
spectrum, why not include, in addition to astrophysics, 
geophysics and especially biophysics? First, I am insufficient- 
ly well acquainted with geo- and biophysics to write about 
them. Second, and this is the main thing, now (and obvi- 
ously to the end of the century) physics and astrophysics 
turn out to be especially closely linked. To write about phys- 
ics as a whole, about its evolution, and not to touch the gen- 
eral theory of relativity, cosmology, quasars, black holes, 
neutron stars, X-ray and gamma-astronomy, cosmic rays, 
neutrino astronomy and gravitational-wave astronomy 
would at present be impossible or at any rate artificial and 
unreasonable (see also Sec. 5). 

One more brief point before we go on to discuss the list 
of key problems in physics and astrophysics. I must mention 
that I compiled such a list in 1974 [1], and have been regu- 
larly updating it ever since. But early in the undertaking 
my objective was not prediction but some methodological 
approach. This question is important in its own right and 
from the viewpoint of the future development of physics, 
and so it is worth discussing in more detail. 

Physics has grown so that it is ever more difficult to ‘see 
the wood for the trees’. For this and other reasons speciali- 
zation in the physical community becomes ever more pro- 
nounced, with ever more researchers becoming narrow spe- 
cialists. But physics is united in its fundamentals, and the 
relations between various areas are strong both in essence 
and in techniques employed. An understanding of this unity 
and its ‘use’, so to speak, appears to be crucial; it helps in 
practical work, let alone the fact that it is so exciting and 
fascinating at least to trace the development of physics and 
astrophysics as a whole picture, not just of some narrow 
disciplines. The last point, it seems, would raise no objec- 
tion. But there is another, not so trivial a point: it is not 
really difficult to gain a smattering knowledge about all the 
new developments in physics and astrophysics as a whole or 
to keep track of everything. But here a young physicist, in 
particular a student of physics, needs some help. Where 
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should it come from and what sort of help should it be? It is 
impossible, of course, literally to ‘know something about 
everything’. Some sort of selection is required. And so I 
compiled a list of about 20 especially important and inter- 
esting (key) problems of physics and astrophysics and com- 
mented on it [1]. My comments sought to explain at a quali- 
tative level the gist of a problem (true, sometimes I give 
semiquantitative estimates), the state-of-the-art and future 
prospects for solving the problem. Article [1] swelled into a 
small book that has passed through several editions and 
been translated into many languages [2]. In 1981 I returned 
to the ‘list of key problems’ and attempted [3], going over 
the list, to discuss the developments in physics and astro- 
physics during the decade that had passed since the publica- 
tion of the list [1]. 

In this article too I decided to follow the above procedure 
to consider the evolution of physics and astrophysics till 
the end of the current century. This makes me a bit uneasy, 
for I sort of ‘rewrite myself’, a quite common abuse with 
many authors. To a certain degree this, of course, applies to 
me as well, since this article repeats some of what has been 
said earlier. I knew of no other way, however. It is well 
known that there exist cumbersome classifications designed 
for libraries, card indices, and reference books. Physics is 
broken down into a multitude of departments, sections, and 
subsections. But I just cannot imagine a way to comment on 
a mammoth catalog like that without writing a huge tome. 
And who would read that tome: experts would gain nothing 
by reading several lines about their field, nor would they 
be satisfied by a few comments on each of other fields (if 
there are hundreds of them). 

To sum up: what follows is an attempt to characterize 
the perspective of the evolution of physics and astrophysics 
to the end of the current century proceeding from a list of 
key problems. 


2. Key Problems in Physics and 
Astrophysics 


Putting aside comments and reservations we will just pro- 
vide the list on which we are going to base our reasoning. 
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2.1 Macrophysics 


. Controlled thermonuclear fusion. 
. High-temperature superconductivity. 
. New substances (production of metallic hydrogen and 


some other ‘exotic’ substances). 


. Metallic exciton (electron-hole) liquid in semiconduc- 


tors. Metal-dielectric transitions. Disordered systems. 
Spin glasses. 


. Second-kind (and related) phase transitions (critical 


phenomena). 


. Surface physics. 

. Behavior of matter in superstrong magnetic fields. 
. Very large molecules. Liquid crystals. 

. Rasers (X-ray lasers), grasers and lasers of new types. 
. Nonlinear phenomena. Solitons. Strange attractors. 
. Superheavy elements (far transuranium elements). 


‘Exotic’ nuclei. 
2.2 Microphysics 


Quarks and gluons. Quantum chromodynamics. 
Unified theory of weak and electromagnetic interac- 
tions. W+- and Z°-bosons. Leptons. 

‘Grand unification’. Proton decay. Neutrino mass. 
Superunification. 


. Fundamental length. High- and superhigh-energy par- 
16. 


ticle interaction. 

CP-invariance violation. Nonlinear phenomena in a 
vacuum in superstrong magnetic and electric fields. 
Phase transitions in a vacuum. 


2.3 Astrophysics 


Experimental checks and applicability of the general 
theory of relativity. 

Gravitational waves. 

Cosmological problem. Relation between cosmology 
and high-energy physics. 

Neutron stars and pulsars. Physics of black holes. 
Quasars and galactic nuclei. Formation of galaxies. 
Origins of cosmic rays and cosmic y- and X-ray radia- 
tions. 

Neutrino astronomy, 
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The grouping of the problems under the headings of macro- 
physics, microphysics and astrophysics is rather arbitrary. 
It is also arbitrary that we have 23 topics, not, say, 20 or 
30. The above listing does not mean to suggest that we 
should only address ourselves to those ‘key’ problems. It is 
simply that they stand out and attract a good deal of atten- 
tion (as reflected, of course, in special and popular-science 
literature) because they hold especial promise for technology, 
are especially mysterious, or any attempts at solving them 
run into insurmountable difficulties, and so forth. But what- 
ever the reasonable way in which the notion of ‘key prob- 
lem’ has been defined, it cannot inherently be eternal; and 
there are hardly two physicists who would compile identi- 
cal lists of key problems. More detailed reservations along 
these lines have been made in [2], where, as in [3], I called 
upon my colleagues to voice constructive criticisms and to 
compile ‘lists’ of their own, to come up with comments and 
to take part in discussions. My appeals met with no success, 
however. I do not know why. 

And so we start with a certain list of problems whose im- 
portance can hardly be denied by anyone. For us they must 
play the role of landmarks, or reference points. With refer- 
ence to them we will present ourselves with the question 
of the further advancement of physics and astrophysics up to 
the year 2001. To be sure, this year is of purely calendarical 
and psychological importance. At the same time it is con- 
venient because extrapolation over a span of 15 years is, it 
seems to me, quite acceptable and feasible. If the time period 
were 50 years (see, e.g. [4]), I would not feel capable of 
writing an article of this kind. 


3. Macrophysics 


Let us now start commenting on the problems in the list. 
An extensive bibliography is available in [2, 3], and so we 
will not clutter our story with references. 

Controlled thermonuclear fusion. This problem has been 
dealt with for 35 years now, and many hindrances have been 
removed, much has been understood. Yet it is too early to 
speak about the problem becoming a technological one. Even 
though tokamak-type (toroidal magnetic) systems have been 
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studied especially well, the construction of the so-called dem- 
onstrational tokamak is frought with many difficulties. 
Stellarator-type systems, open magnetic traps with ambi- 
polar mirrors (recent years have seen a surge of interest in 
these traps), and nonstationary systems with inertial con- 
finement (heating of ‘dust’ particles of a mixture of deute- 
rium and tritium using lasers, electron and ion beams) have 
not yet been studied in sufficient detail to warrant any pre- 
diction of their future prospects. It is quite probable that 
the aim of constructing a commercial thermonuclear reactor 
will be reached by several paths. The literature abounds in 
predictions as to the ways and times. One gathers an impres- 
sion that by the end of this century or at the very beginning 
of the next century will be built an experimental power re- 
actor of at least one type (most likely on the tokamak ba- 
sis). The nature of this article only allows a superficial and 
intuitive opinion: I am especially impressed by the novel 
(linear) magnetic systems (their principle is the simplest), 
and also inertial installations using ion beams (not lasers or 
electron beams). The value of such assessments is not large, 
of course. But in science emotions, enthusiasm and intuitive 
judgements are of no small importance. One has of necessity 
to conceal them in scientific papers, but it is hardly desir- 
able to do this here, just out of caution. In general I think 
that one should not be silent about something out of fear 
of being wrong. 

Thus, by the early 21st century the problem of controlled 
thermonuclear fusion will probably have been solved from 
the viewpoint of physics. This means that it will turn into a 
technological and engineering problem, although perhaps it 
will still retain some properties typical of principal physical 
problems. 

High-temperature superconductivily. Superconductivity was 
discovered in 1911, but even up until now its research fig- 
ures prominently in physics, to say nothing of technology. 
‘Achilles’ heel’ of superconductors is their relatively low 
critical temperature Ter (at T > Ter superconductivity van- 
ishes; furthermore, superconductors must as a rule be used 
substantially below Ter). The first discovered superconduc- 
tor—mercury—had Ter = 4.15 K. It was found in 1973 
that the compound Nb,Ge had Ter = 23.2 K, but ever since 
no superconductors with higher Ter have been discovered, 
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The boiling point T, (under atmospheric pressure) for heli- 
um is 4.2 K, and for liquid hydrogen 20.3 K. Therefore, only 
liquid helium, relatively expensive and scarce, is used in 
all superconductive devices. 

Roughly speaking, the high-temperature conductivity is- 
sue consists in seeking to create superconductors with as 
high Ter as possible, preferably with Ter > 80-90 K (for 
liquid nitrogen Tp = 77.4 K). This problem has been stud- 
ied actively since 1964, but only two results can be noted 
as yet. First, theory suggests that there should be no hin- 
drance to equilibrium superconductors with Ter < 300 K. 
True, for high Ter to be achieved, in any case are needed 
rather special, perhaps even hard to realize, conditions (para- 
meters of materials in question). Second, beginning in 
1978 evidence have been accumulated for the possible ex- 
istence of high-temperature superconductivity in CuCl (with 
impurities) and then in specially treated CdS. But even now 
the question is not yet clear completely both in the experi- 
mental and theoretical contexts. Barring any errors (which 
seems to be highly unlikely) what is observed is either high- 
temperature superconductivity or a new state of matter— 
superdiamagnetism (for more details and bibliography see 
[5]). 

Unlike controlled thermonuclear fusion research, studies 
of high-temperature superconductivity require no enor- 
mous set-ups. And so a breakthrough may come quite unex- 
pectedly, even from a small laboratory. But because I 
have been waiting for such a ‘breakthrough’ for 20 years 
now without any signs of success, understandably I can 
hardly be overly optimistic here. But what I am confident 
of, however, is that high-temperature superconductivity is 
really a key problem in physics that does not receive its 
fair share of attention. There is no guarantee of success, but 
it may well appear that even in the near future, so much the 
more by 2001, high-temperature superconductors will be 
created (see note added in proofs on p. 342). 

Metallic hydrogen. Under a pressure of 2-5 Mbar (more 
accurate estimates are unavailable, unfortunately) solid 
molecular hydrogen (under atmospheric pressure its melting 
point Tm = 14 K) must transform to metal phase. The 
structure and properties of metallic hydrogen are known, if 
only weakly (only theoretical estimates are available), but 
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there are good reasons for expecting that metallic hydrogen 
is a high-temperature superconductor with Ter ~ 100-200 K. 

At first sight the major obstacle in the way of producing 
metallic hydrogen may appear unexpected. To produce re- 
quisite pressures is relatively easy but there are no materi- 
als capable of sustaining without yield static pressures up- 
wards of about 1.7 Mbar (under such pressures in some exper- 
iments even diamond deformed or yielded). The answer is 
sought in two directions: manufacture from diamond of 
chambers (‘anvils’) for more than 2-3 MBar; and using shock 
waves (nonstationary compression). Under ‘shock’ condi- 
tions normally grows not only pressure but also temperature. 
Thermal problems can be circumvented, for instance, by 
using a ‘magnetic cushion’ (a gap ‘out of’ a magnetic field, 
which is a good conductor of pressure but not heat). I can- 
not predict future developments, of course, but I also can 
hardly imagine that within the further 15 years pressure 
in a chamber with the solid molecular hydrogen will not be 
brought up to 5 Mbar. But who knows .... 

I have already stressed that predicting deadlines is un- 
reliable when not considering specific set-ups already under 
construction. I do not possess all the relevant information, 
but I believe that, if metallic hydrogen studies are not dis- 
continued, metallic hydrogen will be obtained before the 
close of this century. 

Metallic exciton liquid in semiconductors. Metal-dielectric 
transitions. Disordered systems. Spin glasses. These questions 
have recently been receiving a good deal of attention. Me- 
tallic exciton liquid in semiconductors (a remarkable liquid, 
really!) has been obtained and many aspects of it have been 
studied. It is of interest, however, to explore the liquid in 
various ‘vessels’ (semiconductors), and also in the surface 
and linear (quasione-dimensional) forms. Under certain con- 
ditions exciton liquid in semiconductors can be dielectric, 
not metallic. To my knowledge, this form has not yet been 
studied. The same is true of the possible superconductivity 
and superfluidity of exciton liquids. 

Metal-dielectric transitions, disordered semiconductors 
and metals, spin glasses, the anomalous Hall effect are all 
favorite subjects in to-day’s solid-state physics. Work here 
goes on on such a wide scale, and my knowledge of the de- 
velopments in these fields is so inadequate, that I would not 
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dare a prediction here. But it seems natural that these prob- 
lems already by the end of the century will give up their 
places in the list of ‘key problems’ to some other problems. 

Phase transitions. Phase transitions can be termed the 
problem of the century (at any rate in macroscopic physics). 
This is especially so with second-kind and related transi- 
tions, and critical points. 

Two main areas can be singled out here: the general theory 
and analysis of the various specific transitions. For a fairly 
long time the behavior of matter near critical points and 
second-kind transition points, where fluctuations are large, 
baffled any attempts at theoretical analysis. But during the 
last two decades the situation has changed—methods have 
been developed that enabled the behavior of the matter to 
be described even in these cases (see, e.g. [6]). But some 
questions still await their answers (the behavior of inhomo- 
geneous systems, e.g. the behavior of helium II near the 
A-point in the presence of walls, flows, external fields, etc.). 
One is thus safe in saying (suffice it to get acquainted with 
the current literature) that the general theory of phase 
transitions continues to be in the highlight of the physics of 
condensed state. 

This is especially so with particular transitions distin- 
guished in some way or other and, specifically, with the 
transitions of *He to superfluid states (at T ~ 2 x 10-3 K), 
a number of transitions in liquid crystals, Bose-Einstein con- 
densation in gaseous atomic hydrogen in a magnetic field. 
Other highlights are transitions in magnetics and ferroelec- 
trics (with the so-called incommensurate phases taken into 
account), light scattering in the vicinity of phase transition 
points, the specific features of the A-transition in helium II, 
and so on. Without a doubt many phase transitions are 
going to remain of wide interest for many years to come. 

Surface physics. The various processes and phenomena 
occurring on the surface are, of course, by no means a new 
range of problems. But the last decade has seen drastic changes. 
We are now able, at least in certain cases, to achieve 
and control the purity and state of surface. Many efficient 
techniques have been developed or improved. Among them 
LEED (low-energy electron diffraction), ARPS (angular 
resolution photoemission spectroscopy), inelastic scattering 
of 1-MeV ions, electron microscopy, surface acoustic waves 
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and surface polaritons, and so forth. Many important re- 
sults have already been accumulated. For example, surface 
reconstruction, i.e. the development on the surface of a 
lattice with a period other than the bulk one (so, under cer- 
tain conditions in Si on the 111 face the lattice constant is 
seven times larger than in the bulk). Worthy of notice is 
surface ordering: surface ferro- and antiferromagnetism, 
ferroelectricity, superconductivity, and superfluidity. Even 
less exotic situations are impressive: in a dielectric (semi- 
conductor) surface electron levels can be occupied only par- 
tially, i.e. surface, unlike the bulk, will possess metallic 
conductivity. This circle of problems also includes transi- 
tions in two-dimensional and quasitwo-dimensional systems. 
This is a vast area of studies that is now on the rise. And I 
do not see any possibility of a decline up to the year 2001! 

Behavior of matter in superstrong magnetic fields. The Zee- 
man splitting of the levels of the hydrogen atom in a magnet- 
ic field Ey ~ eħH/me ~ 10-8 H eV (here the magnetic 
field H is given in oersteds or gausses, which is the same, 
since I use only the CGS system). At the same time in the 
absence of a magnetic field (or in a weak field) the characteris- 
tic distance between levels, determined by the Coulomb in- 
teraction between the electron and proton, is given by Ea ~ 
eim/2h? ~ 10 eV. Clearly, Ey < Ea, i.e. the field can be 
treated as weak, if 


3m2 2 
HER = (EV E met w 3x 10° Oo, 
In superstrong magnetic fields, i.e. when H >3 x 10° Oe, 
the field becomes a predominant factor, and the atom re- 
sembles a needle aligned with the field, i.e. it does not look 
like the normal hydrogen atom at all. For a heavy atom, 
whose nucleus has the atomic number Z, a field will be su- 
perstrong if H >3 x 10° Z® Oe. In laboratory only H <= 
10€ Oe can be reached, and so the issue of superstrong 
fields seemed to be an academic one. But soon after pulsars 
have been discovered (1967-68) it became clear that magne- 
tized neutron stars have magnetic fields of around 10*- 
1018 Oe. By the way, this was clear even before the discov- 
ery: as a well-conducting ‘conventional’ star with a radius 
r, and feld H, turns into a neutron star with r, and Hy 
the flux remains the same Hor} ~ Hyra. And so at ry © 
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101! cm and H, 10? Oe for a neutron star with r, ~ 
10° cm we obtain H, ~ 10!” Oe. 

It follows that matter in neutron stars and near them may 
be subject to superstrong magnetic fields. This circumstance 
is of significance for the theory of pulsars. A detailed 
discussion of the subject lies beyond the scope of the article, 
but even under terrestrial conditions we know that a magnet- 
ic field can dominate a Coulomb field. So for excitons in 
semiconductors (a bound electron-hole pair) the field is 
superstrong if H > 3 x 10°m2-/m%e? ~ 3 x 10° Oe (if for 
electron and hole the effective mass is mer ~ 0.4m, where 
mis the electron mass and € ~ 10 is the dielectric constant 
of the material). The field H ~ 3 x 10° Oe can be achieved 
even in laboratory. The behavior of matter in superstrong 
fields is in a way a special question. But this question is 
clearly of interest and it will attract a great deal of atten- 
tion, especially in the theory of pulsars and semiconductor 
physics. 

Very large molecules. Liquid crystals. These problems are 
different. They are related, for instance, by their importance 
for biology. Biology nowadays became the dominant disci- 
pline in natural sciences. The tendency for links between 
physics and biology to become ever more close will be pre- 
sent for years and even decades. 

Rasers, grasers and other new lasers. The 20th century is 
called not only the atomic or nuclear age, but also the laser 
age. As a matter of fact, even now the role of lasers is great 
and it is growing steadily. If inertial thermonuclear fusion 
is achieved using laser techniques, and if highly efficient 
and economical high-power lasers are created, this is bound 
to affect dramatically our way of life in the future. That is 
why the problem of development of new high-power lasers 
appears to be of importance for physics as well, although it 
is in the first place a technological problem. As for X-ray 
(rasers) and gamma-ray lasers (grasers), the emphasis is on 
the physical side now. It is possible in principle to make 
these ‘lasers’, but, judging from the evidence available, the 
sort of outlook for them does not appear attractive [7]. The 
latter is a fortiori so in relation to projects of using nuclear 
blast to pump rasers and grasers. 

Many devices can in principle be manufactured, but not 
all of them will necessarily be efficient and useful. The suc- 
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cesses of lasers are to a large extent due to their simplicity 
and accessibility. If rasers and grasers appear (and this is 
quite possible) to be exceedingly bulky and complicated, 
then, even if created, they will hardly fit into the fabric of 
our life. For some reason I have formed this attitude to these 
types of lasers. But the history of science knows of many 
cases where scepsis turned out to be unjustified and things 
changed dramatically after some discovery or invention. 
Anyway, the problem of rasers and grasers is still with us, 
it has not been solved yet, but it has not been written of. 
Obviously, further 15 years will decide the fate of these 
types of lasers, but it is not improbable that the problem 
will be bequeathed to the physicists of the coming cen- 
tury. 

Nonlinear phenomena. Solitons. Strange attractors. Non- 
linear physics, notably nonlinear optics, is no new thing. 
But recent years have seen a ‘boom’, or perhaps another 
boom. The vocabulary of physics came to be enriched with 
new terminology: soliton and strange attractor. (The term 
‘soliton’ is not all that young, however. But these days one 
encounters it more and more often.) This field (which can be 
referred to as nonlinear physics) is assuming prominence 
now, a trend that is no doubt characteristic of our days. 
Therefore, in 1982, perhaps somewhat belatedly, I included 
nonlinear physics in the list of key problems. It is hardly 
necessary to dwell on this topic here because it is the subject 
of one of the articles in this book. 

Superheavy elements (far transuranium elements). ‘Exotic’ 
nuclei. These topics are the concerns both of the physics of 
atomic nucleus and macrophysics. Classification is not a 
matter of principle here, of course. But still I think, skip- 
ping ahead somewhat, that nuclear physics as a whole, in 
its present-day status, should be classed with macrophysics 
rather than with microphysics. In fact, in nuclear physics 
we mainly deal with nonrelativistic particles (nucleons, elec- 
trons and, sometimes, positrons). In heavy nuclei there 
are fairly many nucleons, and many features of the nucleus 
make it akin to a drop of liquid. The nucleus has more in 
common with macrophysics, specifically atomic physics, 
than with high-energy physics, a typical representative of 
microphysics. But I reiterate: it is not classification that 
matters here. 
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My list includes only one problem from nuclear physics, 
also rather arbitrarily—a possible result of my lack of com- 
petence in this field. But the gist of the problem is the 
search for and study of ‘exotic’, yet unknown, nuclei. The 
latter include far transuranium elements, i.e. hypothetical 
nuclei with Z > 110-114 with a hightened stability (this 
relative stability is quite probable as a result of shell effects 
at Z = 114, N = A — Z = 184). The problem also covers 
nuclei with an unusual shape, hadronic atoms (e.g. the 
atom of a proton and an antiproton), nuclei with a higher 
density as compared with conventional nuclei (pp = 3 X 
404 g-cm~3), 

I myself is more interested in one area of exploration, 
namely in attempts to find exotic nuclei in cosmic rays. 
Specifically, if one isotope with Z ~ 114 were a long-lived 
(one with a half-life T > 107-108 years), such nuclei (which 
form, say, in supernova explosions) would be observed among 
the cosmic rays that reach the Solar system. In emulsions 
brought to or even beyond the boundaries of the atmosphere 
no reliable evidence for such nuclei have been found 
yet. In 1980 it was reported, however, that in olivine crystal 
of meteoritic origin a track of a nucleus with Z > 110 was 
found. But for the interpretation to be more unequivocal 
and in any case for any confident judgement to be possible 
several tracks will have to be found. And so the question is 
still open. 

Another area of quest for far transuranium elements is 
worth mentioning. Attempts have been reported to synthe- 
size them in accelerators, just as it has been done with the 
elements with Z < 109. 

Macrophysics, which embraces all the key problems just 
discussed, rests on a solid foundation of the special theory 
of relativity, nonrelativistic quantum mechanics, and quan- 
tum electrodynamics. True, nuclear physics in a way is an 
exception here, but this does not change the general picture. 
The presence of so solid a foundation by no means relegates 
macrophysics problems to a second-rate category as compared 
with microphysical ones. The opposite, i.e. belittling the 
importance of macrophysics, is as incorrect as thinking that 
the main thing for a dwelling or laboratory block is the 
foundation. Actually, it is impossible to imagine the edifice 
of physics not only without a foundation but also without 
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the rest. But it goes without saying that those scientific dis- 
ciplines that have a good foundation evolve in a manner di- 
fierent from those making their appearance on ‘virgin lands’, 
when it is necessary to construct a foundation. In the case of 
macrophysics it is the existence of a reliable foundation that 
frustrates expectations of many breakthroughs in the near 
15 years. It is to this that I attribute the fairly steady, opti- 
mistic and on the whole confident prognosis of the progress 
of macrophysics. 


4. Microphysics 


To begin with: What is microphysics? Without some sort of 
convention or definition the question cannot be answered. 
The demarcation line between macro- and microphysics is 
not a stone wall, this category is historically conditioned. At 
one time, even as recently as half century ago, microphysics 
could include all of atomic physics. We have already men- 
tioned that to-day not only atomic but even nuclear physics 
rather belongs to macrophysics. Without reservations we 
now refer to microphysics high-energy physics, the issue of 
the structure of matter at the ‘fundamental’ level (quarks, 
gluons, leptons). On the whole, the spectrum of questions 
involved is rather obvious and, specifically, it is covered 
by the list given at the beginning of the article. 

Space constraints do not allow us to deal at length with 
microphysical issues (problems 12-16) even as briefly as it 
has been made above in relation to macrophysics. But short- 
age of space is one thing, the contents of microphysics is 
another. Here we deal with the most fundamental issues of 
physics and all natural sciences. These days see develop- 
ments of exceptional significance in microphysics. Suffice 
it to say that in microphysics the transition is in progress, 
or even completed, to another level of ‘fragmentation’ of 
matter—instead of nucleons (or baryons in general) and me- 
sons (or in other terms, strongly interacting particles—had- 
rons) this science is now dealing with quarks and gluons as 
the tiniest (fundamental or primary) particles. This, of 
course, is a step of enormous significance, similar to the tran- 
sition from indivisible atoms to atoms consisting of the nu- 
cleus and the electrons. What is more, the various inter- 
actions combine into some unified interaction (in particular, 
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the electromagnetic and weak interactions have already 


united, especially after the discovery in 1983 of W*- and 
Z°-bosons). By and large, during the last decade (although 
much had been anticipated earlier) the image of microphys- 
ics has changed enormously. Something of the sort, as I 
see it, took place in our century only in the years from 1925 
to 1932, when quantum mechanics was constructed and the 
neutron and positron were discovered. Such relatively short 
spells of revolutionary changes occur rarely, therefore we 
referred to them above as to events of exceptional impor- 
tance. 

In such situations, during turmoil and revolution, it is 
especially difficult to come up with some predictions. It is 
quite probable that by the end of the century a lull will 
come, although it is absolutely beyond doubt that far from all 
problems will be solved and new problems and difficulties 
will make their appearance. We will confine ourselves to 
listing some of the already posed questions, answers to 
which, it is hoped, will be obtained in this or next decade. 

Proton decay (one of decay channels being p — n° + et). 
If the mean lifetime of the proton is less than 10°? years, it 
will be possible to detect the decay in the foreseeable future. 
Evidence for instability of the proton would be a discovery 
of fundamental proportions. 

Nonzero rest mass of neutrino (primarily of electron neutri- 
no ve). If the rest mass m, or the rest masses My, and 


my, _ of other kinds of neutrino are higher than 1 eV, and es- 


pecially so if they are larger than 10 eV, this would be of 
colossal cosmological significance. 

Quarks of various flavors, and gluons. On the theoretical 
side, persistent (or, one can even say, vehement) analysis 
is under way on a large number of options of ‘grand unifica- 
tion’ (the unification of the electromagnetic, weak and 
strong interactions), of superunification (that includes, in 
addition to those mentioned above, gravitation), of quantum 
chromodynamics (the theory of strong interaction). Natu- 
rally enough, the situation is far from clear, but it has been 
a long time since theoreticians have been offered so many 
glowing prospects. 

Although quarks have not been ‘mastered’ completely 
yet, the physical community is already widely discussing 
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the issue of protoquarks (preons, etc.), i.e. component parts 
out of which quarks are made up. This problem is of funda- 
mental and profound value. It is hard to believe that 
‘fragmentation’ of matter can go on indefinitely. In all 
probability, there exist some ‘tiniest blocks’ of the Univer- 
se. Two decades ago such blocks were thought to be nucleons 
(baryons), mesons, photons, and leptons. To-day the place 
of hadrons as fundamental particles has been taken by 
quarks. (Now quarks of five flavors are known, but only few 
doubt that there exists another one, the sixth quark; each 
quark can be in one of three states, called colors. Moreover, 
corresponding to each quark there is an antiquark.) The 
issue of protoquarks, as mentioned above, consistsin attempt- 
ing to answer the question: whether or not quarks themsel- 
ves are compound particles, composed of yet more elementa- 
ry particles, protoquarks. Up until now, to the best of my 
knowledge, the question should be considered absolutely 
unclear. But it would be well to reach at last the ‘rock bot- 
tom’ and to stop, perhaps not at quarks, then at the next 
stage. 

Unlike macrophysics, clearly, no ‘quiet’ prognosis seems 
to be possible here even for the next 15 years. But this does 
not mean, I think, that things here are in a state of total 
uncertainty. On the contrary, theoreticians (and now we 
have an army of them, so that they just ‘pounce’ on any 
promising or simply popular area) have produced agood deal 
of options [8, 9] or, to use a modish word, ‘scenarios’. 
Options are sorted out by analysis of experimental evidence 
(the only limitation here is the rate and feasibility of ob- 
taining the experimental evidence). Therefore, the progress 
rather resembles not wandering in dark woods but frantic 
rushing from one brightly lit gamble house to another. May 
I be forgiven for such vulgar comparisons in matters that 
involve the highest strain of the human intellect. But as the 
saying goes, there is but one step from the sublime to the 
ridiculous, and the loftiest elevations of thought may be 
encountered side by side with the prose of life. It is beyond 
me to express the feelings that possess me when I observe 
the life of science not as an outsider but asan active partic- 
ipant, being in everyday contact with concrete people, 
their weaknesses, strengths, passions, joys, and sorrows... . 

But back to microphysics. Į want to stress that in this 
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science much ‘homework’ has been done. A vast breakthrough 
has been accomplished; perhaps 15 years would be just 
about enough for the situation to clarify to such an extent 
that a new stage will come about, the one that will require 
novel ideas. 

Against the background I have just unfolded, the reader 
will perhaps be even surprised by the comment I want to 
make (or rather to repeat—it was also made in my papers 
and a book [1-3]). Namely, I believe that the role of micro- 
physics in natural sciences and the life of the human socie- 
ty now became less significant than it was in the first half 
of the century. This comment seems trivial to me. But the 
fact is that many disagree with it andso I have been severe- 
ly criticized. Therefore, I have (and perhaps this will be 
not without interest for the reader) to expand on this thesis. 

Interest in science, pursuit of science and devotion to 
science are in need of no justification. Science is a calling 
for many, the essence of their lives, just as for others are 
arts, music, and so forth. But they normally do not ask a 
musician why he plays a musical instrument, why he de- 
voted his life to music. In contrast, they often ask a physicist 
why he works, what use will humanity have from his work 
for industries, agriculture, and other spheres of human en- 
deavor. To be sure, I thoroughly agree that one’s work 
should be of use, when it is possible. But the considerations of 
immediate use should not and cannot appear as the main in- 
centive of scientific activities. As for microphysics, any work 
here is totally justified, exciting and fascinating regardless 
of whether or not any practical applications are in the offing. 
In that context, microphysics has been, is and will be at 
the frontiers of physics and natural sciences in general. Con- 
crete tasks vary, of course, but the spirit, atmosphere and 
even style of scientific research in microphysics remain sub- 
stantially unchanged. I always remember the words of 
Einstein, who characterized this atmosphere as ‘the long 
years of search in darkness, full of presentiments, strained 
expectations, alternating hopes and exhaustion’. 

But each medal has two sides. So does science, I think; it 
is not isolated from society, it is closely linked to it. There- 
fore, quite naturally, social repercussions, status and signifi- 
cance for society of some scientific areas or others are con- 
ditioned not only by some intrascientific considerations and 
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the logic of a given scientific discipline, but also by its ex- 
ternal influences. And it is in this context that I think that 
throughout the first half of the current century microphysics 
had been at the forefront of natural sciences. This was a re- 
sult both of the very contents of microphysics and of its 
influence on other disciplines and all society. Really, the 
objects of microphysics were the atom and then the atomic 
nucleus. It was exceedingly difficult to understand the be- 
havior of atoms and electrons in atoms; this required a real 
revolution in science—the creation of quantum mechanics. 
In other words, the microphysics of that time made it pos- 
sible to provide an understanding of the structure (and as a 
consequence to control it in many respects) of matter at a 
level that we all encounter in our everyday life. Hence the 
enormous impact of the results achieved on all natural 
sciences (chemistry, biology, etc.), on technology, on medi- 
cine, and on the entire fabric of life and the development of 
the human society, including, unfortunately, the threat of 
application of mass-destruction weapons. 

To-day’s microphysics—with its quarks, with particles 
that only live a tiniest fraction of a second, and with all-pen- 
etrating neutrinos—has passed through the phase that can 
be compared to ‘warm seas’ and has entered ‘beautiful but 
cruel cold waters’. To cut a long story short, the role played 
now by microphysics is, I believe, akin to that of astrophys- 
ics (save for the exploration of the Solar system). And what 
a remarkable role it is! This appraisal, it seems, is supported, 
first, by an inventory of the literature: the share of micro- 
physical papers has dropped markedly as compared with 
earlier times (of course, considering the changes in the con- 
tents of microphysics, discussed earlier in the article). Sec- 
ond, in our ever changing world one should not be amazed 
at changes in the relative position of even the most impor- 
tant scientific domains in the life of the human society. So 
we now witness the spectacular growth of biology, notably 
molecular biology. Biology now came to the fore and oc- 
cupies the place that in the first half of the century was oc- 
cupied by microphysics. And again we here have a fusion— 
the advances of genetics and other biological sciences, strik- 
ingly interesting in their own right (purely scientifically), 
are making significant contributions to medicine, agricul- 
ture, and so on, ° 
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It might seem that I am knocking at an open door here, 
put this is not so in the light of the criticism I have men- 
tioned above. Criticssay: Who knows that microphysics will 
not in the end yield new great discoveries of practical na- 
ture, similar to the mastery of atomic (nuclear) energy? 
And after all, was nuclear physics not looked upon, just bare 
50 or 45 years ago, as having no practical value whatsoever 
in the foreseeable future? Of course, nobody would 
dare to maintain that microphysics will never come into 
the limelight of society (in the sense that is clear from the 
foregoing). But this is not a ‘must’ and, above all, this does 
not make the evaluation of to-day’s microphysics invalid. 

I think that criticism and irritation engendered by the 
pronouncements about the changed status of microphysics 
come from a lack of understanding of the role and place of 
microphysics within the structure of physics itself. But this 
has already been mentioned at the end of the previous sec- 
tion. Yes, microphysics is a foundation (just like cosmolo- 
gy in astronomy). But the edifice of science, like any edifice 
for that matter, is more than just the foundation. 

There are also no good reasons for regarding the builders 
of foundations as high priests, or some special caste, or 
élite, and for treating other builders of the structure as sec- 
ond-rate craftsmen. But I am not going to pursue any fur- 
ther these clear arguments; they were evoked by sporadic 
frictions or misunderstanding among physicists and astro- 
nomers of various specialities. 


5. Astrophysics 


Astronomy has always been intimately linked to physics, 
the degree of intimacy varying widely with time, however. 
The development of classical mechanics, accomplished in 
general by Newton, drew heavily on the astronomical re- 
sults of Copernicus, Galileo, and Kepler. The telescope, in- 
troduced by Galileo into astronomy, would have been impos- 
sible without the achievements of optics. But with time 
the separation between physics and astronomy became more 
clear-cut and deep. This was so even 40 years ago and mani- 
fested itself in many things. For example, at the time astron- 
omy was taught at the Mechanical and Mathematical De- 
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partment of Moscow University but it is taught now at the 
Physical Department. This example is perhapsnotso strik- 
ing. What is more important these days isthatmany scien- 
tists can only arbitrarily be classed as physicists or astrono- 
mers, and astrophysics became a predominant branch in 
astronomy. The latter circumstance is associated with the 
nearly accomplished process of turning of astronomy from 
optical into all-wave one. Involvement of new ranges (ra- 
dio, submillimetre, X-ray and gamma-ray) brought in new 
people, mainly physicists and radio engineers. On the other 
hand, new remarkable results and astronomical discoveries, 
to be interpreted, called for wide employment of theoreti- 
cal physics and, in general, attracted the attention of phys- 
icists (like me) who became interested in astrophysics. 
My first astronomical paper, on the Sun’s radio emission, ap- 
peared in 1946. Moreover, as it has already been mentioned, 
now it is in general difficult to draw a boundary between 
physics and astrophysics, and so the division is fairly arbi- 
trary. So, the general theory of relativity (GTR) is primari- 
ly physics. But nearly everything related to GTR is acted 
out, so to speak, against the cosmic (astronomical) back- 
ground. Therefore, I bring issues connected with GTR under 
the heading of ‘astrophysics’ in the list of the key problems. 

Let us now turn to some of the key astrophysical problems. 

Experimental verification and applicability of the general 
theory of relativity. In our days fundamental physical theo- 
ries and predictions are, as a rule, checked rather quickly. 
A striking example is the nonrelativistic quantum mecha- 
nics. Although some disputes about its completeness are 
lingering on (about a possibility to get rid of probabilis- 
tic predictions), nobody doubts the validity of the funda- 
mental results of quantum mechanics any more. In this re- 
spect, GTR isanexception: it is ten years older than quantum 
mechanics, but its verification still remains a topical prob- 
lem included in our list. 

The reason for this is absolutely clear. GTR is a definite 
theory of the gravitational field worked out by Einstein 
from the equivalence principle and describing the gravita- 
tional field only using the metric tensor g;, (z'), which de- 
fines the squared interval ds? = g;,dz‘dz® (i, k = 0, 1, 2, 
3; 2° = ct is the time coordinate; c = 3 x 10° cm/s is the 
velocity of light). In the inertial (Galilean) reference system 
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e it exists) wath the Cartesian coordinates we have go) = 1, 
= Loo = E33 = —1, in = O for ik. In a weak grav- 
ine field (iu e. when the components g;, can be sup- 
posed to becloseto the above-mentioned Galilean values) we 
can put go, = 1 + 2plc?, gr = — 1 + 2q/c?, where ọp is 
the Newtonian gravitational potential (we assume here 
that ‘at infinity’ p = 0 and, when we write g,,, we mean 
a spherically symmetrical problem, e.g. the field of an 
isolated sphere). As noted above, the condition for the 
gravitational field to be weak is | 9 |/c?<1. 
On the Sun’s surface 





Itol __ Mo _ Te 9.49 x 10-8 
c? ro 2rg ` 4 
2GM, 
Te@ = — = 2.94 x 10° cm, 





since the solar mass is oe 2 x 1033 g, the radius of the 
photosphere is rọ = 6.96 x 101% cm, and the gravitational 
constant is G = 6.67 x 10-8 cm?- gol. -sl, The quantity 
Tg introduced above is called the gravitational radius of the 
mass M, rg = 2GM/c? ~ 3 x 10®°(M/Mo@) cm. For the 
Earth rea ='0. 86 cm since the Earth’s mass Ms = 5.98 X 
107 g and on the Earth’s surface | p g|/c2? = 7 x 10-10 
(the Earth’s radius is rg = 6.37 x 10° cm). 

For a circular planetary orbit |  |/c? = v?/c?, where v 
is the velocity of the planet; for the Earth’s orbit | @ |/c? x 
10-8, since vá = 3 X 10° cm-s-!. The gravitational field 
within the Solar system can thus 7 said to be not just weak 
but very weak: first corrections to the Newtonian theory of 
gravitation being no more than 2| 99 |/c? ~ 4 x 10°. One 
of the effects predicted by GTR is the deflection of light, ra- 
dio waves, etc., inthe vicinity of the Sun; it is defined by 
the angle a = 4GMo/c?R = 2rgo/R = 1.75re/R ang. s, 
where R is the impact parameter, or the closest distance be- 
tween the light ray and the Sun’s centre (at an angle of 2” 
a tall man is seen from within 200 km). Two other critical, 
as they are sometimes called, effects of GTR predicted by 
Einstein—the gravitational frequency shift and the preces- 
sion of planetary perihelia (the effect is at maximum for 
Mercury) are also of the order of | ọ |/c?; the same applies 
to the relativistic lag of signals in planetary radar observa- 
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tions—this effect of GTR is genetically related to the de- 
flection of light in the Sun’s field (of rather the deflection of 
light and the lag of signals—sort of two sides of the same 
medal). 

For years, even as recently as the previous decade, the 
question of experimental proof of GTR reduced to observa- 
tions of the above effects, and also to the check with the 
maximal accuracy of the equality of the inert and heavy 
masses. Itcan besaid that all the predictions of GTR have 
been borne out and there are no indications as to the pres- 
ence of any contradictions to GTR. But the accuracy is not 
impressive—theory agrees with experiment within an accu- 
racy of one percent or sometimes of 0.1-0.01%. Some pro- 
jects are known that have it as their aim to refine these data. 
There are hardly any objections here, but it seems to me 
that for weak fields GTR can safely be regarded as verified 
with sufficient confidence. But this by no means suggests 
that GTR is automatically valid for a strong field, and so 
verification of GTR in strong fields appear to be a pressing 
problem. 

A word of explanation is in order here (see [10, 11]). Note 
that in spherical coordinates for a sphere of mass M 
we have —g,, = 1/1 —rg/r) = 4 + 7rg/r + (relr +... . 
If r/r = 2GM/c*r = 2| ọ |/e? <1, which exactly answers 
to the weak field approximation (see above), we can put 
—g,, = 1/1 — rglr) = 4 + rg/r. It is in this sense that 
GTR can at present be considered verified; but it is abso- 
lutely clear that this is a far cry from a real verification of 
GTR in a strong field, where rg/r = 1, because in a strong 
field (at z = 1), of course, 1/(4 — x) #1 + x. Those theo- 
ries of the gravitational field that try to compete with GTR 
are now constructed precisely in this way: so that in the ap- 
proximation of the weak field to coincide with GTR and 
yield the expression g,, = —1 — 2GM/c?r. 

To verify GTR in a strong field is a challenging task. 
Within the confines of the Solar system the terms of the or- 
der (p/c?)? are already as a minimum 5-6 orders of magnitude 
smaller than the terms of the order | p|/c?. On the surface of 
neutron stars | @, |/c? = 0.1-0.3 (e.g. for M, = Mo and 
ra = 10 km we have | q, |/c? = 0.15) and there is a possi- 
bility to verify GTR in a not-too-weak field. The same is 
true of binary pulsars. If we leave alone cosmology, howey- 
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er, the issue of this verification is closely linked with the 
problem of black holes. Near the event horizon, to which in 
the spherical coordinates used there corresponds r— rg, 
the gravitational field is, of course, strong. GTR does sug- 
gest a possibility of the existence of black holes. Therefore, 
even the very fact of discovery of black holes would be a 
compelling evidence for GTR, and later on we could hope 
that further observations would provide quantitative checks 
of GTR. At the same time, as is so often in science, the op- 
posite is not true: if no black holes are discovered, this will 
in no way refute GTR. The point is that formation of black 
holes in a collapse of stars or a gas cloud is an exceedingly 
complicated process. Collapse is opposed by the need to 
give away angular momentum, possible fragmentation, and 
nuclear explosions. Therefore, during a period of about 10 
billion years or even less, an age of a number of stars, 
formation of black holes in the observed expanding Uni- 
verse may be a highly unlikely event. 

Anyway, the detection of black holes is one of the most im- 
portant and fascinating problems. I think that this problem 
(or rather the entire issue of black holes) is second in impor- 
tance in to-day’s astronomy, the first one, with good reason, 
is cosmological problem. But it is for this reason that in this 
short article I decided not to touch upon the essence of cos- 
mology and the physics of black holes, just as above I did 
not touch upon the essence of microphysical problems. Any 
other approach would mean to dwarf these all really pro- 
found, great problems of this day.! 

As regards the theory of gravitation I will only confine 
myself to several comments. For a star of radius close to the 
gravitational one and mass M ~ 10M, = 2 x 10° g its 


average density p is of the order of the nuclear density 
Pn 73 x 10% g-cm~-* for which the equation of state of mat- 
ter is in general known sufficiently well. By the same token, 
at densities even several orders of magnitude higher than 
the nuclear one, there are no grounds for expecting so radi- 
cal a change in the equation of state, which might stop the 


1 For more information and bibliography on these problems see 
the books and papers referred to above. There are also a good many 
eas pay accessible books and review articles on those topics (see 
e.g. ). 
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collapse. In this connection, for macroscopic black holes (as 
one can see for M > 107° g~ 10-2M+) any constraints of 
microphysical nature are out of place here and the question 
of applicability of GTR in strong fields is not connected with 
quantum effects or inapplicability of the existing space and 
time concepts ‘in the small’. In fact, suppose that the appli- 
cability of GTR is curbed by quantum effects—the emer- 
gence of quantum fluctuationsof the metric g;,. Then, as is 
well known, GTR is unsuitable only for length and time sca- 
les Z and t, or density p, comparable with Plank’s values 


ly = VRGI® = 1.6 x 103 cm, 
tg = lole = 5.4 x 40744 S, 
Pg = hlelg = c5/hG? = 5.2 x 10% g-cm-? 


(here A = 1.05 x 10-7 erg-s is the quantum constant). 
Corresponding to pg and lę is the mass M, = plg = 
VhelG = hlel, = 2.2 x 10> g. 

And so for lœ lg, t tge, P Kpg M>M, quantum 
effects are no hindrance to applications of GTR—of 
quite a definite classical theory of the gravitational 
field.” If l < lg, t < tg, and so on, quantum effects will be, 
generally speaking, large and GTR will be unsuitable. De- 
velopment of the quantum theory of gravitation—which 
normally involves attempts at quantization of GTR (not 
some other classical theory of gravitation)—is a problem of 
enormous proportions and difficulty. It has not been 
solved completely yet. It follows that the quantum theory of 
gravitation is required for examination of the region near 
the classical singularity (9 — oo) that appears in cosmologi- 
cal solutions and in gravitational collapse. In all probability 
in the quantum theory of gravitation no singularities must 
appear. 

But one should not link the question of verifying GTR 
in strong fields with the account of quantum effects. Near 
the gravitational radius (event horizon) of a massive black 


2 The foregoing does not mean to imply, of course, that quantum 
effects can be ignored completely. So, the black body radiation (evapor- 
ation) of black holes is one of the most important discoveries of physics 
in the last decade (Hawking, 1974), and it can occur also at M > Mg, 
but the gravitational field of a black hole may be regarded as classical. 
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hole quantum corrections are negligible and, if GTR holds, 
definite predictions can be made—for instance, for a nonrotat- 
ing stationary black hole the well-known Schwarzschild 
metrics must be valid [10, 14]. In addition to correlations 
of theory and observations for strong fields, also of importance 
is purely theoretical analysis of the theories of gravita- 
tion that attempt to compete with GTR. There are such theo- 
ries, but they are normally rife with difficulties. For exam- 
ple, they (or at least some of them) contain solutions with neg- 
ative energy, which leads specifically to the following result: 
the orbit of a binary star does not contract under the action 
of gravitational radiation (i.e. the system loses energy), but 
on the contrary it ‘uncoils’ (i.e. the system receives energy) 
[13, 14]. Clearly, such theories are unsuitable and can be dis- 
carded. I know of only one theory of gravitational field 
that generalizes GTR and incorporates it as a special, or 
rather limiting, case. I mean the theory that introduces, in 
addition to the tensor g;,, some scalar field. By and large, 
the problem of analysis of ‘viable’ theories of gravitational 
field is quite a complicated one and calls for a special exa- 
mination. 

In connection with GTR verification the following is 
worth noting. The depth, beauty and simplicity (of course, 
simplicity in some higher sense) of GTR, its proved validity 
in weak fields, and many years of analysis—this all makes it 
of little interest further to verify GTR in the classical re- 
gion, i.e. not near ‘true’ singularities (at event horizon there 
are no such ‘true’ singularities). I have always belonged to 
enthusiastic proponents of GTR. As far as intuition and faith 
are concerned, it seems to me that GTR is absolutely valid 
in the classical region, i.e. its boundaries are determined by 
quantum effects or the possibility of the existence of the fun- 
damental microscopic length l; that is longer than the gra- 
vitational length 1, = 10-*8cm. But physics does not, just 
cannot, rest only on intuition and faith, the supreme judge 
is still experiment and observation. On the theoretical side 
too, some of the possible generalizations of GTR (e.g. those 
involving introduction of a new scalar field or a related as- 
sumption that the gravitational constant G varies with 
time) do not appear as preposterous as all that. Be that as it 
may, I believe that GTR verification in strong fields, the 
question of the existence of black holes, the question of the 
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applicability of GTR are all key problems (or, if you want, 
one general key problem) of present-day physics and astro- 
physics. 

The last comment: the quantum constraints on the appli- 
cability of GTR, which are characterized by Plank’s para- 
meters (see above), follow from the assumption that the ex- 
isting space and time concepts are in general valid down to 
the scales lg and tg. But experience, from high-energy phys- 
ics and quantum electrodynamics, suggests that ‘everything 
is OK’ only for l > 10-1 cm. Physics has not yet ‘penetrat- 
ed’ into the realm of smaller scales. But there is a ‘gap’ of 
all of 17 orders of magnitude between l ~ 10-"° and lg x 
10-85 cm. Therefore, it is hardly possible to view as de- 
void of sense and grounds the decades-old question: whether 
or not there is some fundamental length l}, such that for 
l < i, space already becomes, ‘unusual’, i.e. grained, quan- 
tized, and so forth. 

The concept of fundamental length and, specifically, length 
l, ~ 10-17 cm has been receiving support for years now in 
connection with the divergent expressions in the quantum 
theory of fields. But as regards the fundamental length, all 
the ‘divergences’, it appears, must be ‘truncated’ at distan- 
ces (for wavelengths) of about lẹ. It was such considerations 
that as applied to weak interactions lead to lp ~ 10-1” cm 
(Ex hell, 1 erg ~ 10! eV). But now, after the unified 
theory of electroweak interaction has been created, these 
arguments have been discarded. As a result, it is only rarely 
now that the possibility of the existence of a fundamental 
length is remembered and the theory is confidently extrapo- 
lated down to l > lg ~ 10-33 cm. Well, audacity in physics 
has many times proved to be justified. But this in no way 
removes the question of the fundamental length, we should 
remember this possibility. If there exists some fundamental 
length 1, œ lg, then it is the scale l,, not lg, that will, it 
seems, be the limit of the scope of GTR and the whole ex- 
isting theory. For considerations of dimensions correspond- 
ing to lẹ is the density pr = hlelf» 5 x 10° x (1.6 x 
10-83/1,)* g- cm’. If l ~ 10-1’, then pp = 10° g cm~. 
We see thus that even, say, at lp œ 10-7% cm pp = 10% 
g-cm-?, i.e. it is 52 orders of magnitude smaller than 
Pg. It follows that any suppositions concerning lp are not 
innocent. One should not forget about such a possibility, 
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because we know practically nothing from experiment 
about the scales l< 10-1 cm. 

It is clear from the above treatment, however short and 
fragmentary, that the issue of the singularity, and hence 
cosmological problem in the first place, is closely related 
to microphysics—a remarkable illustration of the profound 
unity and links of physics and astronomy. What is more, 
studies of the ‘early Universe’, i.e. the earliest (close to the 
singularity) phases of the evolution, in cosmological models 
with a singularity (we deal here with time intervals sepa- 
rated from the classical singularity by tg ~ 10-* s or by 
times only severalorders of magnitude longer), appear to be 
nearly the only way of somehow correlating some microphy- 
sical theories with observations [12]. 

In this subsection relatively much space is devoted to 
problem 17 (experimental verification and applicability of 
GTR). It would be a good idea perhaps at least in one in- 
stance not to confine ourselves to general declarations but to 
take a closer look at the problem at hand. Later in the arti- 
cle this will be impossible, unfortunately. And as for prog- 
nosis, on the one hand, nearly 70 years have proved insuf- 
ficient for GTR to be verified completely. But on the oth- 
er hand, the time scale in this case, as in many others, is 
far from linear. GTR has recently become of wide interest, 
and search for black holes is in the focus of attention (see 
below). It is quite probable, therefore, that even before the 
close of this century significant progress will have been 
achieved with GTR verification in strong fields. 

Gravitational waves. Einstein completed the construction 
of the general theory of relativity in 1915 (both the deriva- 
tion of an equation for the gravitational field—the metric 
tensor g;,—and the explanation of the precession of the pe- 
rihelion of Mercury, and the bending of light by the Sun). 
Already in the next year, 1916, Einstein addressed himself 
to gravitational waves and in 1918 he obtained the now well- 
known formula relating the energy emitted in the form of 
gravitational waves to the square of the third time deriva- 
tive of the quadrupole moment of the mass of a nonrelati- 
vistic emitting system, e.g. a binary star. 

Gravitational wave existence appears to be especially 
probable if one considers that general relativity is a general- 
ization of the Newtonian theory of gravitation, analogous 
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to the transition from electrostatics to electrodynamics. 
The velocity of gravitational waves, like that of electromag- 
netic ones, is the velocity of light in a vacuum. Under the 
influence of gravitational waves solid bodies deform and 
vibrate, and in a system of free particles (e.g. two or more 
satellites) separations between them change. 

Among the known interaction types gravitation is the 
weakest one. Under terrestrial conditions we are exposed to 
strong gravitation only because gravitational forces are 
long-range ones, and so the entire Earth acts on us. In the 
case of, say, two protons, their gravitational attraction is 
e?/Gm;,  10°° times weaker than the Coulomb repulsion 
(proton “charge is e = 4.8 x 10-1° CGS units and proton 
mass is mp = 1.67 x 10-* g). And so the luminosity (pow- 
er) of gravitational emission Lg is also relatively small. 
For instance, for a binary star with the component masses 
M,~ M,~ Mo = 2 xX 10°® g and the orbital radius equal 
to that of the Earth’s orbit (r = 1 a.u. = 1.5 x 10% cm), 
Lg = 10! erg - s-t, whereas the total een) 
luminosity of the Sun is Lg = 3.86 x 10% erg - 

The weakness of gravitational interaction, and es of 
gravitational waves, makes their detection a difficult prob- 
lem. They have not yet been detected. Even for a stellar col- 
lapse, supernova explosion or neutron star collision one can 
hardly expect a gravitational radiation with a total energy 
release W, in excess of 10% erg (~Mo¢?). A more likely val- 
ue is W, ~ 10° erg, and then for R ~ 3 Mpc = 10” cm 
the integral flux of gravitational waves on Earth will be 


Py ~ W,/4nR? ~ 10 erg - cm~? (in a volume with radius 
Rz > 3 Mpc there are about 300 galaxies, and one can ex- 
pect 1-10 supernova explosions a year). An oncoming grav- 
itational wave (or impulse of gravitational waves) changes 
the distance l between ‘test masses’, which serve as a detec- 
tor, by Al = hl, where h characterizes the integrated flux 


F, for the gravitational signal and its duration is 1, (it ap- 
pears that [15] h~ V GF yxgle. ) The quantity h is generally 


negligible. For instance, in the example cited, where F, = 
10 erg-cm~*, for the gravitational signal with duration 
Tg ~ 10° s the characteristic dimensionless amplitude of 
the gravitational wave is k œ 10-*°-10-21. But the most op- 
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timistic estimates (at Fg ~ 10° erg - cm”? and Tg œ 10°? s) 
yield h x 10”. 

Available first-generation gravitational wave antennas 
cannot detect such small impulses, for their sensitivity is 
insufficient. Second-generation antennas now under construc- 
tion (specifically, sapphire ones) are designed to measure 
the already achieved level of Al ~ 10-7” cm. Hence at the 
antenna length Z ~ 100 cm we get already h = Alll ~ 10-7". 
Therefore, under favorable conditions, when no less than 
one event withh > 10-!° will reach the Earth, gravitational 
waves will be detected in the near future. It is quite proba- 
ble, however, that for real explosions h ~ 10-2°10-?!, and 
then we will have to wait rather long to detect them (this, 
perhaps, will take place already in this century). 

The detection of gravitational waves of cosmic origin, and 
thereby the birth of gravitational-wave astronomy, is pri- 
marily of astronomical interest. As a matter of fact, this 
would provide one more channel of astronomical informa- 
tion. What is important, of course, is that this information is 
not just another line of evidence obtained from other 
sources. Specifically, the impulse of gravitational waves char- 
acterizes the changes in the quadrupole moment of a star, a bi- 
nary system, and so on. At the same time, the issue of grav- 
itational waves is closely linked with GTR tests. Theories 
of the gravitational field, other than GTR, state that gra- 
vitational waves are, generally speaking, emitted in a dif- 
ferent way than as described by GTR. Besides, these waves 
may turn out to be not purely transverse as in GTR. 

It is significant for the time being, while gravitational 
waves have not yet been detected, that the GTR formula for 
the power of gravitational radiation can be tested, so to speak, 
in an indirect manner. So, observations of the changes in 
the elements of the orbit of the binary pulsar PSR 1913 + 
16, carried out during the last years, attest the validity 
of the GTR formula, and hence of GTR itself [13, 14]. As 
a result of the emission of gravitational waves the binary 
system naturally loses energy and the stars (one of them is a 
radio pulsar) come closer together, and the period of their 
orbital motion decreases (in particular, for PSR 1913 + 16 
the annual reduction in the orbital period is 7.6 x 10-5 s). 
Observational evidence agrees with the assumption of the 
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energy loss in accordance with GTR and disagrees with the 
predictions of some other theories of the gravitational field. 
True, as is usual in such cases, observing one object is yet 
insufficient to warrant any confident judgement (one can al- 
ways assume that the agreement of theory with observations 
for one object is just a coincidence). Anyway, the results ob- 
tained for PSR 1913 + 16 go to support the validity of 
GTR, at least with reference to the analysis of a number of 
questions concerned with gravitational-wave astronomy. 
And so with ever more confidence one can now draw on the 
GTR results when designing detectors of gravitational 
waves, estimating their power, and so forth. 

There is no doubt that the issue of gravitational waves and 
gravitational-wave astronomy will remain among the key 
problems of astronomy well into the 21st century. 

Cosmological problem. Relation between cosmology and 
high-energy physics. Cosmological problem, viewed in a rath- 
er wide sense, is a most important one in astronomy. Its 
place in astronomy is in a way similar (although not identi- 
cal) to that of microphysics in physics. This has already been 
mentioned above; several comments have been also made 
concerning cosmology and its links with microphysics. Theo- 
retical studies in this field are now extensive and intensive. 
But whatever the strides of astronomy, to get observations 
of use to cosmology will be very difficult. As a rule, such ob- 
servations require many years of work. An example is the 
2.4-m optical telescope now under construction, which will 
operate in space. It will be commissioned in several years 
and is supposed to remain the largest device of this type to 
the end of the century. This project, in all probability, is 
going to yield much cosmological evidence. But for the rea- 
sons mentioned above I will not dwell on cosmology any 
longer here. 

Neutron stars and pulsars. Physics of ‘black holes’. In this 
special subsection we are not going to touch upon this to- 
pic. We have dealt with neutron stars, pulsars and black 
holes elsewhere in the text (for more details see [2, 3] and 
the references given there). 

Quasars and galactic nuclei. Formation of galaxies. Pulsars 
were discovered in 1967 (the first publication was made ear- 
ly in 1968, however), and their nature became clear less than 
two years later. It was found that pulsars are rotating mag- 
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netic neutron stars. The fast rotation (one of the fastest of 
the known pulsars, PSR 0531 in Crab has the period T = 
0.033 s) and the strong magnetic field (H = 10% Oe) 
of a neutron star are quite understandable (or rather explain- 
able) from the simplest considerations involving the con- 
servation of angular momentum and magnetic flux when a 
‘conventional’ star turns into a neutron star (this has been 
mentioned above in connection with problem 7). Inferences 
to this effect had been made in the literature even before the 
discovery of pulsars. For one thing, I do not want to belit- 
tle the role of theoretical work; for the other thing, I do not 
want to make a secret of the fact that my own attempts at 
theoretical treatments of pulsars’ radiation were not espe- 
cially successful (although the problem has not yet been 
solved completely [16]). I just want to say the following. If 
immediately after the first pulsars astronomers had not dis- 
covered fast-periodic pulsars in Crab and Vela, which 
could only be associated with the model of rotating neutron 
stars, disputes about the nature of pulsars would have gone 
on for years (a rival of the neutron star model would have 
been the white dwarf model). 

I have made this digression about pulsars because the out- 
come of quasar studies is absolutely different. There had been 
some earlier indications hut by measurements of red shift 
in the spectrum of the quasar 3C273 quasars were dis- 
covered in 1963, i.e. several years earlier than pulsars. But 
even now the nature of quasars, or rather of their central 
regions, remains unclear. This does not mean, of course, 
that during the 20 years that have elapsed since the discov- 
ery of quasars little has been done. On the contrary, much 
work is under way but it so happens that no key has been 
found to the ‘black box’, as the central region of a quasar or 
of the active galactic nucleus is sometimes referred to. 

Before explaining the meaning of that term, a few words 
about what can be regarded as established. 

Quasars are located at cosmological distances. This im- 
plies that the red shift of the lines observed in the spectra 
of quasars is of cosmological origin, i.e. is due to quasars re- 


3 It is not unlikely, however, that there exist long-period pulsars, 
white dwarfs in their nature, but this does not change the situation 
in general. 
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ceding from us as the Universe expands. This issue caused 
much dispute, but I (as most astronomers) have never seen 
any grounds for questioning the cosmological nature of the 
red shift in quasars’ spectra, and now, it seems, there is 
consensus in this respect.* Furthermore, quasars in the initi- 
al sense of the concept (quasistellar radio sources— QSR) 
appeared to be just a relatively small part of the more gen- 
eral class of quasistellar objects (QSO), the one of quasistel- 
lar sources. Moreover, it appeared that quasars and quasistel- 
lar objects are the nuclei of gigantic galaxies, i.e. they are 
surrounded with stars. Similar active nuclei are also present 
in many ‘conventional’ galaxies (so, the famous Seyfert 
galaxies are giant spiral galaxies with active nuclei). In 
what follows we, for short, lump all of these objects togeth- 
er and speak just about quasars, i.e. compact (radius 
r ~~ 1016-40!” cm) extremely bright sources—their total lu- 
minosity may be as high as 10% erg - s! (our Galaxy’s 
luminosity is La ~ 1044 erg - s7!). 

Thus, it appears, that barring some exceptions, we deal 
with the same class of objects: massive compact nuclei in 
the central regions of galaxies. The trend to form such nuclei 
is quite clear, since in the centre of galaxy or a gas cloud 
(protogalaxy) there is a deep potential gravitational well. 

No doubt, energy release in quasars involves some grav- 
itational energy. The total energy release of quasars seems 
to be as high as 1061-106? erg (luminosity up to 10% erg - s~? 
during 3 x 105-3 x 10° years). Even for a black hole, when 
energy release can be a sizeable proportion of the rest mass 
Mc?, the mass of the nucleus must be 10°Mo. In general, the 
mass M for quasars and active nuclei undoubtedly reaches 
(108-10°) Mo or even more (the mass of the Galaxy is Mg ~ 
4011 Mo). 

In addition to radio, IR and visible radiations, many qua- 
sars are powerful emitters of X-rays. So, in one of the lists 
of 111 quasars examined by the satellite-borne X-ray observ- 
atory Einstein (satellite HEAO B-HEAO 2 launched on 
13 November 1978) 35 quasars appeared to be sources of re- 


4 On the other hand, noncosmological red shift is also possible in 
principle, and in some individual cases (for some objects) it could 
even dominate. But this reservation changes nothing in relation to 
the overwhelming majority of quasars and active galactic nuclei. 
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corded X-rays (Ex = 0.5-4.5 keV, Ly = 108-10% erg - st). 
Quasars can, obviously, be high-power sources of y-rays, 
although hard evidence for this is only available for qua- 
sar 3C273 (for E, = 50-500 MeV the luminosity L,~ 
2 x 10% erg-s-t at Ly = 10% erg - st). 

But what does an emitting nuclei of r œ 1018-10" cm 
look like? In all probability, the emitting region itself is 
in no extraordinary conditions. But the ‘furnace’ (the energy 
source) is concealed within this region and is not seen. The 
nature of this ‘furnace’ is immaterial in the sense that it is 
of no consequence for the properties of the observed electro- 
magnetic emission. In this connection the term ‘black box’ 
is suitable. But the content of that ‘box’ still remains un- 
clear. 

The most popular models of the central region of a quasar 
and active galactic nuclei are: dense stellar cluster, magne- 
toid or spinar (magnetoplasmic rotating ‘superstar’ without 
a massive black hole at the centre), and a massive black 
hole. Combined models are also possible, of course. The stel- 
lar cluster model is the least likely, because in it, at least 
in its simple forms, individual variations of the intensity of 
quasars’ radiation are associated with explosions of indivi- 
dual stars or, say, with collisions of two stars. But this can 
hardly account for the observed marked variations of the 
luminosity. Also, the stellar cluster model is liable to the 
same objection as the magnetoid (spinar) model—it is un- 
clear why such a formation will not end up as a black hole. 
For the latter reason and for ‘general considerations’ the 
most popular now is the massive black hole model. 

As noted above, I am a strong believer in GTR (in the 
classical region, which is sufficient in this case) and I do 
not doubt that GTR admits the existence of black holes. 
Nonetheless, the model of a quasar—a massive black hole— 
appears to be not proved, not by a long shot. Even if the 
validity of GTR in strong fields is not put in question, there 
might be insufficient time for a massive black hole to form 
(as noted above). Further, there are still no direct indica- 
tions of the existence of black holes in galactic nuclei—the 
black box is still closed. Lastly, there are objections to the 
widespread existence of massive black holes in galactic nu- 
clei. The fact is that if black holes formed easily, one could 
expect that they would appear in the nuclei of currently in- 
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active (of course, we mean here the time of observation on 
Earth) galaxies, specifically in the Galaxy. It is not un- 
likely in principle that such ‘dead’ or ‘half-dead’ (inactive) 
black holes do exist. Their activity is associated with accre- 
tion of gas or the accretion of stars around the hole, and it 
might well be that there were not enough stars when neces- 
sary. Unfortunately, reliable analysis of this problem is a 
fairly difficult task. According to some authors, at the centre 
of the Galaxy there may be no black hole with M > 10°M¢. 
Others believe that at the centre there are even two black 
holes with masses of about 10°Mo each, the separation be- 
tween them being about 3 x 101” cm. The question of the 
central region of the Galaxy, or rather of the presence there 
of a black hole or a pair of black holes, is now extremely 
topical or, one can even say, burning. After all, here we 
deal with a relatively close object, and so we hope that the 
question will be solved in the not-too-distant future. 

With active galaxies and quasars it is obviously more dif- 
ficult to establish the nature of the active region (core). 
One possible approach is to observe the periodicity of varia- 
tions of quasars’ luminosity. For the magnetoid (spinar) 
model such variations, specifically those due to spinning, 
are quite natural and more stable than in the model of a 
black hole with an accretion disc. It is in principle possible 
to distinguish a black hole from a magnetoid using also the 
methods of neutrino astronomy (see below). 

When will the nature of the ‘black box’ at the centre of 
a quasar be established with assurance? Perhaps this will 
take long years, but success may come sooner, and at any 
rate for one case (example) the door of the black box will 
be ‘ajar’ and we will get a glimpse of its contents. As follows 
from the foregoing, if the nature of the nucleus of the Galaxy, 
although inactive, were clarified, it would be an outstanding 
accomplishment, especially if it were proved that at the 
galactic centre there lies a single or binary massive black 
hole. 

Since in this article we attach much importance to a defi- 
nite time limit—the year 2001—it would be quite a realistic 
estimate to maintain that by that time the nature of the 
central regions of quasars and galactic nuclei will be deter- 
mined. 

The heading of this subsection (problem 21 in the list) 
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also includes the formation of galaxies. It is in a way related 
to the nature of quasars and galactic nuclei, but on the whole 
this is a problem in its own right. It involves, above 
all, those ‘initial’ perturbations, the development of which 
in an expanding Universe has led in due time to the forma- 
tion of galaxies and their clusters. The question still awaits 
its final answer. New elements here are as follows: first, 
the account of the role of neutrinos if the neutrino mass is 
fairly large (say, m, ~ 10 eV = 10-8? g x 10 -°m, where 
m = 9 x 10-238 g is the electron mass); second, the possibil- 
ity is discussed that initial perturbations might be of a 
purely quantum nature, and so for them the classical theo- 
ry of gravitation near the singularity available within its 
frame is absolutely unsuitable. 

It is clear from the foregoing that the issue of the forma- 
tion of galaxies is exceedingly interesting and important; it 
is going, obviously, to remain in the focus of attention yet 
for many years. 

Origins of cosmic rays and cosmic y- and X-ray radiations. 
Another name for this rather wide domain is the astrophy- 
sics of high energies (one can also bring under this heading 
the astrophysics of high-energy neutrinos, a topic touched 
upon in the next subsection). In a number of papers I dis- 
cussed the problems of high-energy astrophysics and its out- 
looks [17, 18]. Therefore, I will only say that up until the 
close of the century we might expect to get answers to a num- 
ber of most important questions that concern cosmic rays 
in the Galaxy, the nature of cosmic rays with superhigh ener- 
gies (E > 10”-10"° eV), gamma-astronomy and some more 
narrow or special areas. By the year 2001, and especially 
by 2012 (the centenary of the discovery of cosmic rays) the 
relative contribution of high-energy astrophysics to astro- 
nomical sciencies, although not small even nowadays, will, 
no doubt, be still higher. 

Neutrino astronomy. Nearly all evidence about space and 
all astronomical information are obtained by receiving elec- 
tromagnetic radiations. Since the earliest days well into 
this century instead of ‘electromagnetic radiation’ we, with- 
out a great deal of exaggeration, could say ‘light’ or even 
just ‘visible light’. The picture has changed drastically 
ever since. Astronomy has been transforming from optical 
into all-wave one. Radioastronomy and, since the last de- 
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cade, X-ray astronomy have become as importan as opti- 
cal astronomy. Submillimetre and gamma-astronomy, a yet 
more important and wide sphere, are lagging behind a bit. 
But these disciplines, it seems, will catch up with the prog- 
ress of optical: astronomy in this decade, or, and this is ab- 
solutely for sure, by the end of the century. The word ‘catch 
up’ should be understood as a figure of speech, of course. The 
importance of bringing in new radiation ranges lies specifical- 
ly in that each of them yields new, unparalleled, form of 
evidence. There is a measure of overlap, of course; but pic- 
tures of the skies obtained in radio waves, optics, X-rays and 
y-rays are different pictures. The meaning of this is quite 
clear and familiar. As an example and reminder, I could 
mention that the brightest radio source Cassiopeia A, a 
supernova remnant, in its time was not reliably noticed op- 
tically at all, and now it is only observable in best telescopes. 

Along with electromagnetic waves and meteorites we re- 
ceive charged particles from space—cosmic rays. Examina- 
tion of them is of significant astronomical and physical inte- 
rest [17, 18]. Among the known reserves are the detection 
of gravitational waves (gravitational-wave astronomy) and 
the detection of cosmic neutrinos. Neutrino astronomy is 
worth especial mentioning. 

There are already in existence three directions of neutri- 
no astronomy: solar astronomy—the detection of solar neut- 
rinos; the search for neutrino flares from supernova explo- 
sions, etc., and the neutrino astronomy of high and superhigh 
energies. 

The Sun and other ‘burning’ stars are for the most part 
powered by nucler reactions (the role of gravitational con- 
traction is normally insignificant and short-term). The nu- 
clear reactions occur, of course, within the bowels of stars, 
near the centre, where the temperature is maximal. Neutri- 
nos, however, are capable of piercing the star and this is 
their unique feature (by the way, gravitational waves pos- 
sess even higher penetrating power). As for solar neutrinos, 
they are a subject both of long-standing discussions and ar- 
duous attempts to detect them using a chlorine detector; 
which is based on the reaction *’Cl + ve > ?’Ar + e-. The 
detector is only effective for neutrinos with E, >0.81 MeV, 
which are produced on the Sun in relatively small. quantities 
mainly as a result of the decay of ®B nuclei. The flux of such 
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neutrinos is quite sensitive to the temperature within the 
Sun and to solar models in general. Therefore, it is difficult 
precisely to predict the expected flux of solar neutrinos to 
be detected by the chlorine detector. 

The last of the predicted fluxes I know of, the one ob- 
tained for more or less ‘standard’ solar models, is 642 SNU 
(SNU stands for solar neutrino unit; at a flux of 1 SNU, 
103% target nuclei, in this case *’C] nuclei, capture on average 
one neutrino a second). The latest observational data, as 
of 1983, are about 2 SNU (or rather 2.2+0.3 and 1.8 
0.4). This, about four-fold discrepancy between predic- 
tion and observation, precipitated a flood of papers. Lately 
this disagreement is often linked with hypothetical neutrino 
oscillations (mutual transformation of electron, muon and, 
perhaps, tau-neutrino ve, Vp, and v,). But I share the opinion 
of the many who believe that the reality of this discrep- 
ancy can still be held in question, since we here deal with 
the neutrino fluxes that are highly sensitive to models of 
the solar interior. On the contrary, these models have prac- 
tically no influence on the total flux of solar neutrinos, 
which (on Earth) is 6.1 x 10 cm™?. s-! and it is caused by 
the reaction p + p —> d + e* + ve (note that the flux of 
neutrinos produced by the above-mentioned reaction 
8B — !Be + et + ve only amounts to 10°cm-?. s-}), 

Neutrinos from the (p + p)-reaction can be recorded suc- 
cessfully using the gallium detector ("Ga isotope transforms 
into Ge, the threshold energy for neutrinos being only 
0.23 MeV; the energy of neutrinos from (p + p)-reaction can 
be as high as 0.42 MeV). Gallium measurements call for a 
detector of mass 30-50 t, and the gallium is practically pre- 
served. Other detectors can also be employed (Li, ‘In, 
etc.). Unfortunately, we will have to wait several years for 
the new detectors to become operative. On the whole, how- 
ever, the astronomy of solar neutrinos has already been born. 

Both the chlorine installation and some other detectors 
are capable of recording neutrinos that must emerge in stel- 
lar collapses in the Galaxy (neutrinos produced by some nu- 
clear reactions with E, < 10 MeV). But no such events have 
been registered yet. This attests that collapses in the ga- 
lactic disc occur on average once in several years (this esti- 
mate agrees with others). It is likely that in the future neut- 
rino flares will be observed on collapsing stars. 
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Finally, some words about the neutrino astronomy of 
high energies. 

Neutrinos of high energies, say more than 10° eV, are 
only produced by the proton-nuclear component of cosmic 
rays. In this respect, the situation is similar to the produc- 
tion of m°-mesons in cosmic rays (such n°-mesons, which 
quickly decay to give y-protons, are an important object 
of investigation in gamma-astronomy). There are projects, 
notably the DUMAND project, of recording high-energy neut- 
rinos underground and under water. DUMAND is designed 
to register neutrinos at a depth of about 5 km in the ocean 
by optical detectors (from the Cherenkov radiation of a cas- 
cade of charged particles produced by neutrinos). The ener- 
gy threshold of the set-up is, of course, d:«rmined by its 
size, but it must inevitably be very high—of the order of 
102-103 GeV. The angular resolution will be as good as 1°, 
i.e. this is going to be a sort of neutrino telescope. For neut- 
rinos with superhigh energy Æ, > 10°-10° GeV deep-water 
projects with acoustic registration method are discussed. 

Prime candidates among the galactic sources of high-energy 
neutrinos (EZ, œ 102-10° GeV) are young (up to one 
year) shells of supernova explosions and ‘latent’ sources— 
pulsars and black holes surrounded by thick layers of mat- 
ter. In principle, extragalactic sources can be registered as 
well: quasars, active galactic nuclei, galaxies in the forma- 
tion phase (this is assumed to have taken place at red shift 
parameters z = (A — A,)/A,u 10-30; for the most distant 
of the known quasars z < 4). We here restrict ourselves to 
the remark that from neutrino radiation and its relationship 
with the gamma-luminosity of quasars and active galactic 
nuclei one can in principle distinguish a black hole from a 
magnetoid (in the case ofa black hole surrounded with a dense 
shell, the neutrino flux can be large with the flux of y- 
rays with E, >70 MeV being small; for a magnetoid the 
gamma-flux is larger than a certain minimum computed 
from the data on neutrino flux [19]). 

Unfortunately, this method ‘works’ not for all models of 
a massive black hole, and, most important, some more years 
will have to elapse before we can realize a large-scale obser- 
vational project. 

One other problem of importance in neutrino astronomy is 
attempts at observation of relict neutrinos, which have 
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formed early in the cosmological evolution (like background 
thermal microwave radiation, but much earlier). This prob- 
lem is especially fascinating and important in connection 
with the assumption that the neutrino mass is nonzero and 
neutrinos can have an outstanding role to play in cosmology 
(for m, > 1-10 eV). I know of no real paths, however, that 
would lead to direct detection of such neutrinos. 

Decades proved to be an insufficient time for neutrino 
astronomy to establish itself, which is no wonder because ob- 
servational techniques employed are exceedingly complicat- 
ed. But I think that already in this century neutrino astro- 
nomy will become an effective and extensive area of re- 
search. 


, 


6. On Scientific Revolutions in Physics 
and Astronomy 


We have been dealing so far with more or less specific is- 
sues of physics and astrophysics from the viewpoint of their 
outlook for research up until the end of the 20th century. 
But there are, of course, problems of more general nature, 
which are concerned with the speed of development, forms, 
specific features and organization of research, and so forth. 
A discussion of the constellation of these problems as a whole 
lies beyond the framework of this article. It would be re- 
levant, however, to make some comments and, specifically, 
to touch upon the issue of the nature and contents of the no- 
tion ‘scientific revolution’. In the literature there is no con- 
sensus as to the meaning of the term ‘scientific revolution’ 
itself and appraisal of some or other ‘revolutionary’ events 
in science, in the past and at present. Naturally, people al- 
so think about the future. Moreover, no matter how inter- 
esting is the history of science in itself, its objective study 
is important and justified, it seems, precisely with the aim, 
proceeding from the past and present, to look into the fu- 
ture and in a way to foresee this future. Specifically, we are 
willing to perceive whether we are having now a revolution 
in physics and astronomy, and if so, what its content is, 
how it develops and what we are to expect in the future, and 
so on. 
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I do not dare say that those disputes about scientific re- 
volutions are really of importance. For a physicist’s ear 
much in them sounds scholastic. But I somehow found my- 
self involved into one of the discussions when I was ap- 
proached with the request to write a review [20] of the Russian 
translation of the well-known book by T. Kuhn [21]. What 
is more, I have held for some time (and written about it) 
that in astronomy only two revolutions took place. The first 
one, associated with the name of Galileo, was brought about 
by the coming of the telescope; and the second one is taking 
place these days with optical astronomy becoming all-wave. 
This view seems to me to be one-sided and in need of revi- 
sion. And so the following are some remarks concerning 
scientific revolutions (for more details see [20, 22]). 

Kuhn argues [21]: ‘A revolution is for me a special sort 
of change involving a certain sort of reconstruction of group 
commitments. But it need not be a large change, nor need it 
seem revolutionary to those outside a single community, 
consisting perhaps of fewer than twenty-five people.’ Such 
an interpretation seems to me to be a storm in a teacup. Of 
course, as is always with terminology, it is impossible to 
forbid, or prove unfounded, nearly any definition. Between 
a storm in the ocean and in a teacup there are an infinite 
number of cases that differ in their scope (a storm in a gulf, 
in a lake, in a pond, in a swimming-pool, and so on). Revo- 
lutions in science can, and this is done, be divided into glo- 
bal, local, microscopical ones, and so forth. In the final ana- 
lysis, one can use the term scientific revolution virtually 
for any jump or ‘reconstruction of commitments’. But this 
approach can hardly be considered expedient. The notion 
of true revolution intuitively brings to mind a great break- 
through or an upheaval of major proportions. It would be 
quite natural, therefore, to reserve the term ‘scientific re- 
solution’ for a profound change in natural sciences or just 
in one scientific discipline (physics, astronomy, biology). 
Any further fitting of revolutions to a Procrustean bed of 
hierarchical rungs (global, local, etc.) would hardly bring 
us anywhere. The real task, which is of some interest, con- 
cerns concrete analysis of concrete histories and situations 
in science. 

Viewing revolutions in physics and astronomy from this 
angle I would dare make some comments. So, in astronomy, 
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barring its early establishment in the antiquity (the astro- 
nomical system of Hipparchus-Ptolemy), we have had only 
two revolutions so far. The first occurred in the 16th and 
17th centuries; it is primarily concerned with Copernicus 
and Galileo. Clearly, it involved the transition from geocent- 
ric views to heliocentric views and Galileo’s breakthrough 
in observational techniques (the use of the telescope), which 
caused an avalanche of remarkable discoveries. The second 
revolution in astronomy has occurred (and, generally speak- 
ing. it is still in progress) in our century, and it is also two- 
fold. On the one hand, worthy of mention here are the discov- 
ery of the nonstationary nature of the Universe and the gen- 
eral establishment of extragalactic astronomy. On the 
other hand, the second astronomical revolution consisted 
in optical astronomy becoming all-wave astronomy. As in 
Galileo’s days, this brought about a number of fine discov- 
eries already mentioned above. 

With this approach (or, if you like, classification) the 
duration of each revolution becomes somewhat longer. But 
the picture becomes more consistent and, most important, 
agrees with historical facts. 

Let us now turn to a similar classification in physics. If 
again we for the moment forget about antiquity, we might 
single out two revolutions in physics. One of them is con- 
cerned with the creation of classical mechanics (Coperni- 
cus, Galileo, Kepler, and especially Newton) and of the en- 
tire foundation of the edifice that is generally referred to as 
classical physics. Nobody would belittle the significance 
and grandeur of the achievements of Faraday, Maxwell and 
other fathers of electrodynamics, but, as it seemed in the 
last century, electrodynamics is in accord with classical 
mechanics. Therefore, it would be relevant to associate the 
second revolution in physics only with the creation of the 
theory of relativity and quantum mechanics. The time spans 
are generally disputable. I think that the second revolution 
in physics should be placed at 1900-32. 

The foregoing is rather trivial; I suspect, many think this 
way. But this opinion is in sharp contrast with that of Kuhn, 
that is why I thought it expedient to voice it here. 

If we stick to this opinion, it would be only natural to ask 
about the development of physics to-day. Do we jlive be- 
tween revolutions, or perhaps a third revolution is there? 
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Such questions are not without interest. A number of com- 
ments in this connection are made in my paper [22]. 

I use specific examples there in an attempt to demonstrate 
the fallacy of Kuhn’s scheme with its ‘change of para- 
digms’, ‘normal science’, ‘anomalies’, ‘extraordinary stud- 
ies’, etc., etc., with reference to a wide variety of processes 
in science in the past and at present. The only exceptions are 
the above-mentioned true (in the above sense) scientific 
revolutions on a large scale. Other, more modest, develop- 
ments in science should, perhaps, be referred to as struggles 
of various hypotheses, competition of research programs. 

I believe that an important, if not prevalent, factor, exert- 
ing its influence on the character (style, forms) of the pres- 
ent and future development of science, is accumulated ex- 
perience. In fact, as we know well from the history of sci- 
ence, the transition from the geocentric to heliocentric system 
or from antique mechanics to the classical (Newtonian) one 
occurred at a time dominated by scholasticism and metaphy- 
sics, and the growth of science was hindered by religious 
dogmatism, and so on. Late in the 19th century and early 
in the 20th century the dialectical understanding of the de- 
velopment of science did not establish itself yet, scientific 
thinking was dominated by mechanicism. New concepts and 
any constraints on the applicability of age-old notions were 
perceived sometimes as a kind of collapse, the ‘disappearance 
of matter’, and so forth. But these days our understanding 
that knowledge being relative in no way contradicts its 
objective, or even absolute, nature, the understanding of the 
correspondence principle for transitions from old theories to 
new ones, and the understanding of the dialectical character 
of the development of science and the process of cognition— 
are all common knowledge. Put another way, dramatic changes 
have occurred in the climate in which evolves science, 
in which live and work those who are responsible for the ad- 
vancement of science. 

Hadrons have been replaced as ‘fundamental elements’ of 
the Universe by quarks and gluons. But hardly anybody 
would on these grounds maintain that physics was collaps- 
ing, as it has been when the mechanistic picture of the 
world changed to modern thinking, or, at least, to electron 
theory. And it is hardly relevant to treat hadron concepts 
as one paradigm and quark concepts as another, with all 
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the implications of a change of paradigms. These considera- 
tions, which can easily be pursued, make me think (or at any 
rate assume) that the present-day and further evolution of 
physics and astronomy has in general passed the phase or 
phases of establishment that have been characterized by 
large-scale and drastic (global) upheavals, which have been 
called above ‘true scientific revolutions’. In other words, 
the character of the development of physics and astronomy 
from about the mid-20th century has changed in many re- 
spects. This is in part caused by the accumulated past expe- 
rience, and by the wide spread in science of the method of 
dialectical analysis of evolutionary processes. Another rea- 
son is the change of scope of scientific work (in lieu of soli- 
tary scholars we now have an army of scientific workers) 
with the concomitant sharp increase in the rate of informa- 
tion exchange and the growth of intimate everyday links of 
most scientific disciplines with industries, agriculture, 
medicine—in a word, with the many aspects of the life of 
the human society. 

Now we turn to one other important qualitative point, 
which concerns the state-of-the-art of physics and astrono- 
my. In the past when sciences were greedily accumulating 
‘initial’ knowledge, generally at a relatively low level of 
understanding of the laws of nature, scientists were used to 
be exposed to the kaleidoscope of concepts, to their unbound- 
ed growth and evolution. Beyond doubt, scientific cogni- 
tion remains and will remain in the future essentially unli- 
mited. In this sense, ‘finitism’, understood as the existence 
of some limit in the development of science, does not hold 
water. But does this mean to suggest that everything is rel- 
ative, that some laws and concepts keep being replaced by 
others all the time, the new candidates differing in a funda- 
mental way from their earlier counterparts? It is hard to 
agree with such an ‘infinitism’, which one encounters from 
time to time. Yes, instead of hadrons we now deal with 
quarks, perhaps even protoquarks will come next. But is it 
natural to think that the splitting of matter is not an infinite 
process and, if not quarks, then protoquarks will appear to 
be ‘final’ (tiniest) building blocks of the Universe? In any 
event, such a possibility is not at variance with anything. 
If it is realized, this in no way means the end of physics, 
since an infinite variety of questions will still remain. We 
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should not overlook the fact that the hitting of some ‘bot- 
tom’, the establishment of the limit of division of matter, 
if any, will be an important stage in the history of physics— 
it will change the character of its further development in 
the most profound and significant way. It is quite probable 
that we are already close to the answer to this question. 

Let us take another, simpler example. We know to-day 
that astronomical information can come to us from distant 
space in the form of electromagnetic wavesof various ranges, 
in the form of cosmic rays (charged particles), neutrino, 
and gravitational waves. By the end of the century all these 
information channels will, in all probability, be utilized 
in practice, and further on they will only expand. This will 
complete the remarkable process, we mentioned repeatedly 
(it is characteristic of the 20th century!), of astronomy becom- 
ing all-wave. In addition, we also have the astrophysics 
of cosmic rays, neutrino astronomy and gravitational-wave 
astronomy. Now there are no concrete grounds for believing 
that some new, yet unknown channels of astronomical in- 
formation will make their appearance. Logically, it is as 
possible for this to happen as it is for new channels to be 
nonexistent. I, for one (and many others, perhaps), am of the 
latter opinion. In such a case, if new channels do not come 
in actual fact (since there are none in nature), the implica- 
tions will be considerable for astronomy in the 21st century, 
and in later centuries for that matter. 

I see how incomplete are those considerations that are 
contained in the last several remarks. It has been my aim, 
however, not so much to state anything but rather to invite 
a discussion (it is practically under way already) of the pos- 
sible changes of the very nature of science in the foreseeable 
future. 


7. Concluding Remarks 


In 1938, when I graduated from the Physical Department of 
Moscow University, or in 1940, when I joined the department 
of theoretical physics at the Lebedev Physical Institute ot 
the USSR Academy of Sciences, physics in many respects 
was as it is now. Quantum mechanics, the theory of relativ- 
ity, let alone electrodynamics and statistical physics, have 
remained now the same, although in teaching these dis- 
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ciplines new forms of representation of material and espe- 
cially new examples have to be used. In 40 years, and even 
more so in 15 years, from now university physics (and phys- 
ics in general) may and even must be taught from the same 
principles, which had more or less established themselves 
by the mid-20th century. Modifications, and significant at 
that, are concerned, for one thing, with the fundamentals 
of microphysics (quantum theory of fields and particles, and 
so on); for the other, with the numerous physical domains 
(experimental high-energy physics, solid-state physics, es- 
pecially theoretical, physical kinetics, plasma physics, acous- 
tics, optics and spectroscopy, including laser spectrosco- 
py, and so on). It is here, in specific (sometimes highly nar- 
row) branches and ‘twigs’ of the mighty tree of physics (to 
use lofty style) that took place the main events, main 
growth, and expansion of physics. To be sure, by ‘main 
events’ I mean developments that are important not in the 
sense of their significance for physics as the science of the 
‘nature of things’ (in this respect the transition from had- 
rons to quarks is much more important), but in the sense of 
the status of physics in natural sciences and the life of the 
human society. 

The quantitative growth that has taken place is such that 
it already changes the situation qualitatively. In those re- 
mote years I recalled at the beginning of this section (1938- 
41) it was sufficient for me once a weak to call at the library 
and during an hour or two to look through, quite attentive- 
ly, a dozen, or less, relatively thin journals, new arrivals. 
In the theoretical department headed by I. E. Tamm of the 
Lebedev Physical Institute of the USSR Academy of Sci- 
ences worked five people. We all knew, of course, who does 
what, we got together often and discussed a host of ques- 
tions, we argued much and we all were members of a weekly 
seminar. In Moscow at that time there was only one kindred 
group of theoreticians, about that strong, headed by 
L. D. Landau. We met from time to time. At the time I 
was 25, now I am 70, and an aberration of memory is quite 
possible; perhaps what changed was not so much the state 
of affairs in science as the man himself (in this case myself). 
I think, however, that it is not this that matters. The main 
thing is that now our theoretical department includes not 
five but 50 workers. Therefore, even in my official capacity 
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(I bear the honorable, but not easy, burden of being the head 
of the department) I am in no position to be aware of the 
interests of the majority of my subordinates. Needless to 
say, it is even more difficult in relation to other theoreti- 
cians in Moscow. Further, I chair two weekly seminars. I 
receive 30-40 contents of journals and with a considerable 
delay (sometimes unavoidable) I receive xeroxed papers 
from the All-Union Institute of Scientific and Technica) 
Information. And so I have to sort them out, look through 
them and distribute them at seminars—an additional toll 
on my time. Another concern is dozens of Soviet and foreign 
journals. As a result, a sea of journals, reprints and preprints 
literally inundates a scientific worker these days. 

The many years of experience only help me to keep my 
head above water in this sea and to be of some help for 
others. Most importantly, I can, however superficially, scan 
a wide spectrum of problems. But where can I find the time 
for work, those quiet hours that are so needed for this? 
But this is my personal concern, because I am not so ‘typ- 
ical’, and the 21st century is for me an abstraction. And 
what lies in stock for the physicists who are now 22 or 23 
and who are at the beginning of their career in science and 
who are to enter the next century in their prime? They may 
have received splendid training (we have many topflight 
professors, reputable universities, colleges, and depart- 
ments, a wealth of journals and other literature). But the 
very logic of life of these young men, the conditions of their 
work urge them to become narrow specialists. Popular ex- 
cuses or absenteeism at seminars are shortage of time and 
need to keep up with the literature on ‘one’s topic’. But the 
net result is often not some time saved but some loss of a 
broad physical picture, which has a boomerang effect in any 
narrower field. Older generation physicists should remember 
this and fight the above-mentioned negative trends. 

It is from this angle that I view my activities concerned 
with compiling and commenting the ‘list’ of key problems 
[1-3]. But this problem is in need of much more systematie 
attention (within the framework of the Academy of Sciences, 
Soviet and world scientific community). As a positive exam- 
ple of the global approach I would like to refer with grati- 
tude to the activities of the International Astronomical 
Union (IAU). The Union has brought under its umbrella 
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all astronomers in the world and is fostering the internation- 
al astronomical community. Unfortunately, there is no 
such thing as the world’s community of physicists. A physi- 
cist by education and experience, I at any rate go in for as- 
trophysics as a ‘by-profession’. And so I see clearly how more 
close are the ties between astronomers than between phys- 
icists, especially between workers in different disciplines. 

I cannot expand on this topic here. But for years I have 
been achingly dissatisfied with the meagre trickle of books 
and papers by professional physicists devoted to at least 
some of the problems just discussed. Historians of science 
write more and their activity is on the whole useful, but 
they normally ‘lag in phase’ and do not feel the ‘pulse of 
life’ in to-day’s physics. And how can you possibly in such 
situation evaluate the present-day picture and look into the 
future? 

In the latter respect I have perhaps been not too successful 
as well. But nevertheless I visualive (maybe this is just 
a mirage!) a wide steel rail track, not ideal perhaps, along 
which rolls a long train inhabited with physicists and their 
assistants, with their devices, journals, dreams, fates ... . 
The train has just passed the mark 1987, ahead lies the plat- 
form ‘1990’, and farther on, nearby and distant at the same 
time, is the cherished station ‘2000’, a gate to the next cen- 
tury. 
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